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A ^C^OLI^^^NWNG^^O^^^A^TI^NCATEyVACCINE^DERIVED^^ 

5 THEREFROM AND USES THEREOF E ° 

FIELD OF TRF iNVFMTrnxr 

The present invention relates generally to a polypeptide of a N-tenninal choline binding 
protein A truncate. The invention also relates to vaccines which provide protection or 

Pr0teCt ' Ve amib ° dieS 10 baC,erial ^^ally pneumococcus, and to 

antibod.es and antagonists against such polypeptide for use in diagnosis and passive 

immune therapy. The polypeptide and/or the nucleic acid encoding die polypeptide are 
also used., as a competitive inhibitor of bacterial adhesin of pneumococcus. Lastly, this 
invention is directed to therapeutics using the polypeptide. 

B ACKGROUND O F THE TTsJVFNfTTOXT 

Streptococcus pneumoniae is a gram positive bacteria which is a major cause of invasive 
mfomm. such ^ sepsis> meningitis, otjt . s ^ a ^ |obar pneumon . a 

NEJhl 322: 1280-1284, 1995). Pneumococci bind avidly to cells of, he upper and lower 
respiratory Pact. Like most bacteria, adherence of pneumococci to human cells is 
achieved by presentation of bacteria, surface proteins tha, bind to eukmyotic 

,, hydrateS m * lectm ' llke fashlon (Cundell, D. & Tuomanen, E. (1994) Microb 

at og 17.361-374). Pneumococci bind to non-inflamed epithelium, a process that can 
* viewed as asymptomatic carriage. I, has been proposed that the converaion to invasive 
isease involves die local generation of inflammatoiy factors which, activating the 
oman cell, change the number and type of receptors available on the human cells 
< un e , D. e, at. (1995) Nature, 377:435-438). Presented with an opportunity in this 
new sedrng, pneumococci appear to take advantage and engage one of these unregulated 

377°™' P ‘ atele ‘ reCCP '° I (CUnde “ - * < I995 > 

■ • Within minutes of the appearance of the PAF receptor, pneumococci 

un ergo waves of enhanced adherence and invasion, Inhibition of bacterial binding to 

ed cells, for mate*, by soluble receptor analogs, blocks die progression to disease 
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in animal models (Idanpaan-Heikkila, I. «< cl. (1997) J. Infect. Di 176.704-712). 
Particularly effective in .his regard are soluble carbohydrates containing lacto-N- 
neotetraose with or without an additional sialic acid which prevent pneumococcal 
attachment to human cells in vitro and prevent colonization in the lung tn vtvo. 

5 

Choline binding proteins: candidate structural adhesin gene: 

Pneumococci produce a family of surface proteins capable of binding to the bactenal 
surface by non-covalent association to the cell wall teichoic acid or lipoteichoic acid. 
The surface of Streptococcus pneumoniae is decorated with a family of CBPs (Choline 
10 Binding Proteins) that are non-covalently bound to the phosphorylcholine. CbpA, is an 
75kD surface-exposed choline binding protein that shows a chimeric architecture. There 
is a unique N-.erminal domain a proline rich region Mowed by a C-terminal domain 
comprised of 10 repeated region responsible for binding to choline. 

1 5 CbpA, is an adhesin (ligand) for the glycoconjugate containing receptors present on the 
surface of eucaryotic cells. Mutants with defects in cbpA showed reduced virulence m 
the infant rat model for nasopharyngeal colonization. This binding is directed to cholme 
determinants which decorate the teichoic acid and is mediated by a signature cholme 
binding domain in each of die members of this family of proteins. The cholme binding 
20 domain was discovered and fully characterized by Lopez et al. in his sttid.es of the 
autolytic enzyme (Ronda el of. (1987) Eur. J. Biockem, 164:621-624). Other proteins 
containing this domain include the autolysin of the pneumococcal phage and the 
protective antigen, pneumococcal surface protein A (PspA) (Ronda, C. et al. (1987) Eur. 
J Biochem.. 164:621-624 andMcDaniel, L.S., et al. (\992) Mtcrob Pa, bog. 13:261-269). 

25 CbpA fails to colonize the nasopharynx domain which is shared with to other family 
members C terminus) but to activity of binding to human cells arises from to unique N- 
terminal domain. Since theprocess of colonization and dm progress, on to disease depend 
on pneumococcal attachment to human cells as a primary step, interruption of the 
function of the N terminal domain, either by cross reactive antibody or by competitive 
30 inhibition with a peptide mimicking this domain, may be critical to blocking disease. 
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ChoHne binding proteins for a,H,p ncumoC()ccal vacclnK are 

International Application No. PCT/US97/07198 and such PCT Application is 

'"T; ,ed 1,1 ^ eM,re ‘ y by refera “- vaccines against S. pneuntoniae employ 

purified carbohydrates of the capsuies of die 23 most common serotypes of thts 

aetennm, bu, such vaccine is only 50% protective (Shapito e, al. NJEM 325:1453 

99 and , no, tmmnnogenic under the age of 2. Further, a therapeutic polypeptide’ 

would offer a thempeutic option in cases of infection with multi resistant organisms 

erefore, the tnventiou herein fills a long fel, need by pmviding a protective vaccines 
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SUMM ARY OF THF. fMVRMT,A H 


The present invention provides an isolated polypeptide comprising an amino acid 
sequence of a N-tenninal choline binding protein A truncate. The polypeptide comprises 
tite ammo actd sequence as se, forth in SEQ ID NOS 1,3-7, or 9-, 1, inc , u d in g 

tagntenff, mutants, variants, analogs, or derivatives, themof. Also, this invention 
provtdes an .solatia, polypeptide comprising an amino acid sequence of a N-tenninal 

c o e indittg protein A truncate having the amino acid as set forth in SEQ ID NO 24 
wherem the polypeptide exhibits its tettia^ stiuchue and methods of preparation such 

a po ypepti e. The tsolated polypeptide are suitable for use in immunizing animals and 
humans agatnst bacterial infection, preferably pneumococci. 

In a still further aspect, tite present invention extends to an N-temuna, choline binding 

P AttUnCate, - a ™g lectin activity and no choline binding activity. Still further tins 

invention provtdes an tmmunogenic N-tenninal chohne binding protetn A tiuncate or a 
25 fragment thereof. 


30 


The present invention also relates to isolated nucleic acids, such as recomb, nan, DNA 

ZZ “ 7" '° ned *““■ ° r dCgenerate ™ mUs ”»■**,*, analogs, or fiagmentit 

thereof, which encode the isolated polypeptide or which . 

of the polypeptide. Preferably, the isolated nucleic 


variants, mutants, analogs, or fragments thereof has 


i competitively inhibit the activity 
acids which includes degenerates, 
a sequence as set forth in SEQ ID 
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NOS: 12, 14-17, 19-22 or 23. In a farther embodiment of the invention, the full DNA 
sequence of the recombinant DNA molecule or cloned gene so determined may be 
operatively linked to an expression control sequence which may be introduced into an 
appropriate host. The invention accordingly extends to unicellular hosts transfotmed 
5 with the cloned gene or recombinant DNA molecule comprising a DNA sequence 
encoding the present invention, and more particularly, the DNA sequences or fragments 
thereof determined from the sequences set forth above. 


Antibodies against the isolated polypeptide include naturally raised and recombmantly 

10 prepared antibodies. Thesemayincludebothpolyclonalandmonoclonalanttbod.es 

prepared by known generic techniques, as well as bi-specif.c (chimeric) ant.bod.es, and 
antibodies including other functionalities suiting diem for diagnostic use conjunctive with 
their capability of modulating bacterial adherence including but no. luntted to acting as 

competitive agents. 


15 


20 


I, is still a further object of the present invention to provide a method for the treatment 
of mammals til control the amount or activity of the bacteria or its subunite, so as to treat 
or aver, the adverse consequences of invasive, spontaneous, or idiopathic pathological 
states. This invention provides pharmaceutical compositions for use in therapeutic 
methods which comprise or are based upon the isolated polypeptides, their subunits or 

their binding partners. 


Lastly, 'this invention provides pharmaceutical compositions, vaccines, and diagnostic 
and therapeutic methods of use thereof. 


25 


PPTPP DFS fTP TPTT ON OF THF- DRAWINGS 


FIGURE 1. Schematic representation of choline binding protein A (CbpA) and 
recombinant truncates R1 (from about amino acid 16 to amino acid 321 
from the N-terminus of CbpA as set forth in Figure 2) and R2 (from 
about amino acid 16 to amino acid 444 from the N-terminus of CbpA as 
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set forth in Figure 2). Domain A is from about amino acid 1 53 to amino 
acid 321 from the N-terminus of CbpA amino acid sequence as set forth 
in Figure 2 ; domain B is from about amino acid 270 to amino acid 326 
from the N-terminus of CbpA amino acid sequence as set forth in Figure 
2), and C is from about amino acid 327 to amino acid 433 from the N- 
terminus of CbpA amino acid sequence as set forth in Figure 2. 


FIGURES 2A-B 

Comparison of homologies of various serotypes of the nucleic acid and 
amino acid sequence of the N-terminal region of CbpA. 

FIGURE 3. Expression and purification of recombinant R1 and R2. 


FIGURE 4. Results of passive protection in mice. Immune sera against recombinant 
R2 protected mice from lethal S. pneumoniae challenge. 

FIGURE 5 Titration of anti-R2 antibody on R6x adhering to LNnT-HSA coated 
plates. 


20 FIGURE 6. Titration of anti-Cbp-A and absorbed anti-CbpA antibodies for activity 
blocking pneumococcal adherence to LNnT-HSA coated plates. 


FIGURE 7. 


25 


Results of active protection in mice. Immune sera against recombinant R1 
protected mice from lethal S. pneumoniae challenge (challenge 560 cfu 
serotype 6B). 


DETAILED DESCRIPTION 

The present invention is directed to an isolated polypeptide comprising an amino acid 
30 sequence of a N-terminal choline binding protein A truncate. The polypeptides are 
suitable for use in immunizing animals against pneumococcal infection. These 
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polypeptide or peptide fragments thereof, when formulated with an appropriate adjuvant, 
are used in vaccines for protection against pneumococci, and against other bacteria with 
cross-reactive proteins. 

5 This invention provides an isolated polypeptide comprising an amino acid sequence of 
a N-terminal choline binding protein A truncate. In one embodiment the polypeptide has 
the amino acid sequence as set forth in any of SEQ ID NO 1, 3-5, 7, or 9-1 1 , including 
fragments, mutants, variants, analogs, or derivatives, thereof. In another embodiment 
the polypeptide has the amino acid KXXE (SEQ ID NO 6). 

10 

This invention provides an isolated polypeptide comprising an amino acid sequence of 
a N-terminal choline binding protein A truncate as set forth in Figure 2. In one 
embodiment, the polypeptide has an amino acid sequence which is a conserved region 
as set forth in Figure 2. For example, conserved regions include but are not limited to 

15 amino acid sequence 158 to 210; 158 to 172; 300 to 321; 331 to 339, 355 to 365, 367 to 
374; 379 to 389; 409 to 427; and 430 to 447. Figure 2 sets forth homologies of various 
serotypes of the nucleic acid and amino acid sequence of the N-terminal region of CbpA 
which are contemplated by this invention. 

20 Further, this invention provides an isolated polypeptide comprising an amino acid 
sequence of a N-terminal choline binding protein A truncate having the amino acid as 
set forth in SEQ ID NO 24, wherein the polypeptide exhibits its tertiary structure. In one 
embodiment the polypeptide is an analog, fragment, mutant, or variant thereof. Variants 
contemplated are set forth in Figure 2. This invention also provides an isolated 

25 polypeptide comprising an amino acid sequence of a N-terminal choline binding protein 
A truncate having the amino acid from about position 16 to about position 475 of 
serotype 4 as set forth in Figure 2 or a corresponding amino acid of serotype 4 as shown 
in Figure 2 , wherein the polypeptide exhibits its tertiary structure. In one embodiment 
tertiary structure corresponds to that present in the native protein. 

30 
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Methods of preparation of the polypeptide are for example as follows: cleaving a full 
length choline binding protein A with hydroxylamine, wherein the hydroxylamine 
cleaves the choline binding protein A at amino acid Asparagine (N) at position 475 of 
serotype R6x and serotype 4, or the corresponding amino acid of serotype R6x or 
5 serotype 4 in a different serotype as shown in Figure 2, thereby creating the N-terminal 
choline binding protein A truncate. Alternative methods which create a truncated choline 
binding protein A or fragment thereof; and retain the native tertiary structure (i.e. that of 
the full length choline binding protein A) are contemplated and known to those skilled 
in the art. Because the polypeptide retains its tertiary structure, the isolated polypeptide 
1 0 is suitable for use as an immunogen in immunizing animals and humans against bacterial 
infection, preferably pneumococci. 

The polypeptide comprising the amino acid sequence of choline binding protein A 
(CbpA) serotype type 4 is as follows: 

1 5 enegatqvptssnranesqaeqgeqpkkldserdkarkeveeyvkkivgesy 

AKSTKKRHTITVALVNELNNIKNEYLNKIVESTSESQLQILMMESRSKVDEAV 
SKFEKDSSSSSSSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKD 
QKEEDRRNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQKIKQAE 
AEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGEL 
20 ATPDKKENDAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKE 
EDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKEPRNEEKVKQAKAE 
VESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKP AEQPQP AP APKAE 
KPAPAPKPEN (SEQ ID NO 24). 


Polypeptide R2 means a polypeptide comprising the amino acid sequences from 
position 16 to position 444 of the N-terminal truncate of choline binding protein A 
(CbpA) serotype type 4 (see Figure 1) which has the following sequence : 

ENEGATQVPTSSNRANESQAEQGEQPKKLDSERDKARKEVEEYVKKTVGESY 
AKSTKKRHTITVALVNELNNIKNEYLNKIVESTSESQLQILMMESRSKVDEAV 
30 SKFEKDSSSSSSSDSSTKPEASDTAKPNKPTEPGEKV AP.A tCtCK’V F.FAFT CK' AT<rr> 
QKEEDRRNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQK1KQAE 


SNSDOCID: <WO 9951 1B7A2_I„> 



WO 99/51187 


PCT/U S99/07668 


- 8 - 

AEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGEL 
ATPDKKEND AKS SDS S V GEETLPSPS LKPEKKV AEAEKKVEE AKKKAEDQKE 
EDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKEPRNEEKVKQAKAE 
VESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPA (SEQ ID NO 1). 

5 

The DNA sequence which encodes polypeptide R2 of the N-terminal truncate of choline 
binding protein A (CbpA) serotype type 4: 

GAGAACGAGGGAGCTACCCAAGTACCCACTTCTTCTAATAGGGCAAATGA 
AAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGA 
1 0 G ATAAGGC AAGG AAAGAGGTCGAGGAATATGTAAAAAAAATAGTGGGTG 
AGAGCTATGCAAAATCAACTAAAAAGCGACATACAATTACTGTAGCTCTA 
GTTAACGAGTTGAACAACATTAAGAACGAGTATTTGAATAAAATAGTTGA 
ATCAACCTCAGAAAGCCAACTACAGATACTGATGATGGAGAGTCGATCAA 
AAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCAA 
1 5 GTTC AGACTCTTCCACT AAACCGGAAGCTTC AGATAC AGCGAAGCC AAAC 
AAGCCGACAGAACCAGGAGAAAAGGTAGCAGAAGCTAAGAAGAAGGTTG 
AAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCGTCGTAA 
CT ACCC A ACC ATT ACTT AC AAAACGCTT G AACTTGAAATT GCT GAGTCCG 
ATGTGGAAGTTAAAAAAGCGGAGCTTGAACTAGTAAAAGTGAAAGCTAA 
20 CGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAG 
AGTAAACAAGCTGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTG 
AAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTAAAGAGCAAG 
GTAAACGAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAAC 
ACCTGATAAAAAAGAAAATGATGCGAAGTCTTCAGATTCTAGCGTAGGTG 
25 AAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAAAAAGGTAGCAGA 
AGCTG AG AAG AAGGTTG AAG AAGCT AAG AA AAA AGC CG AGG AT C AA A AA 
GAAGAAGATCGCCGTAACTACCCAACCAATACTTACAAAACGCTTGAACT 
TGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTAG 
TAAAAGAGG AAGCT AAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGC 
30 AAAAGCGGAAGTTGAGAGTAAAAAAGCTGAGGCTACAAGGTTAGAAAAA 
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ATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCAG 
CAGAAGAAGATAAAGTTAAAGAAAAACCAGCTG (SEQ ID NO 12). 

Amino acid sequence of CbpA of serotype 4: 

5 ENEGATQVPTSSNRANESQAEQGEQPKKLDSERDKARKEVEEYVKKIVGESY 
AKSTKKRHTITVALVNELNNIKNEYLNKIVESTSESQLQILMMESRSKVDEAV 
SKFEKD SSSSSSSDSS TKPE ASDT AKPNKPTEPGEK V AE AKKK VEE AEKK AKD 
QKEEDRRNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQKIKQAE 
AEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGEL 
1 0 ATPDKKENDAKSSDS S VGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKE 
EDRRNYPTNTYKTLELELAESDVEVKKAELELVKEEAKEPRNEEKVKQAKAE 
VESKKAE ATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKP AEQPQP AP APKAE 
KPAPAPKPENP AEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPA 
QPSTPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATG 
1 5 WLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGSMATGWLQYNGS 

wyylnangsmatgwlqyngswyylnangdmatgwvkdgdtwyyleas 

GAMKASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN. (SEQ ID 
NO 2) 

20 DNA sequence encoding the amino acid sequence of the CbpA of serotype 4: 

GAGAACGAGGGAGCTACCCAAGTACCCACTTCTTCTAATAGGGCAAATGA 

AAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGA 

GATAAGGCAAGGAAAGAGGTCGAGGAATATGTAAAAAAAATAGTGGGTG 

AGAGCTATGCAAAATCAACTAAAAAGCGACATACAATTACTGTAGCTCTA 

25 gttaacgagttgaacaacattaagaacgagtatttgaataaaatagttga 

ATCAACCTCAGAAAGCCAACTACAGATACTGATGATGGAGAGTCGATCAA 

AAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCAA 

GTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAAC 

AAGCCGACAGAACCAGGAGAAAAGGTAGCAGAAGCTAAGAAGAAGGTTG 

30 aagaagctgagaaaaaagccaaggatcaaaaagaagaagatcgtcgtaa 

CTACCCAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCG 
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ATGTGG A AGTT A AA AAAGCGGAGCTT G AACT AGT A AAAGT G AAAGCT AA 
CG AACCTCG AG ACG AGC AA AAAATT AAGC AAGC AG AAGCGGAAGTT GAG 
AGTAAACAAGCTGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTG 
AAG A AGC AG A AG A AG AAGCT AAACG AAGAGC AG AT GCT AAAG AGC AAG 
5 GTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAAC 
ACCTGATAAAAAAGAAAATGATGCGAAGTCTTCAGATTCTAGCGTAGGTG 
AAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAAAAAGGTAGCAGA 
AGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAA 
GAAGAAGATCGCCGTAACTACCCAACCAATACTTACAAAACGCTTGAACT 
1 0 TGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGgCTTGAACTA 
GT AAAAG AGGAAGCT AAGGAACCT CG AAACG AGG AAAAAGTT AAGC AAG 
CAAAAGCGGAAGTTGAGAGTAAAAAAGCTGAGGCTACAAGGTTAGAAAA 
AATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCA 
GCAGAAGAAGATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACCAG 
1 5 CGCCGGCTCC AAAAGCAGAAAAACC AGCTCC AGCTCCAAAACC AGAGAA 
TCCAGCTGAACAACCAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAA 
GAAGACTATGCTCGTAGATCAGAAGAAGAATATAATCGCTTGACTCAACA 
GCAACCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCCAAAAACA 
GGCTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTC 
20 AATGGCGACAGGATGGCTCCAAAACAAtGGCTCAtGGTAcTACcTCAACAG 
CAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGGTTCATGGTACT 
ATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGT 
TC ATGGT ACTACCTAAACGCTAATGGTTC AATGGCGACAGGATGGCTCCA 
ATACAATGGCTCATGGTACTACCTAAACGCTAATGGTTCAATGGCGACAG 
25 GATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGAT 
ATGGCGACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGC 
ATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGT 
ACTATGTCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGC 
TATGGAGTCAATGCCAATGGTGAATGGGTAAACTAA (SEQ ID NO 13). 

30 
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“Polypeptide Rl" means a polypeptide comprising the amino acid sequences from 
position 16 to position 321 of the N-terminal truncate/ choline binding protein A (CbpA) 
serotype type 4 which has the following sequence : 

ENEGATQVPTSSNRANESQAEQGEQPKKLDSERDKARKEVEEYVKKIVGESY 
5 AKSTKKRHTITVALVNELNNIKNEYLNKIVESTSESQLQILMMESRSKVDEAV 
SKFEKDSSSSSSSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKD 
QKEEDRRNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQKIKQAE 
AEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGEL 
ATPDKKENDAKSSDSSVGEETL (SEQ ID NO 3). 

10 

The DNA sequence which encodes polypeptide Rl is: 

GAGAACGAGGGAGCTACCCAAGTACCCACTTCTTCTAATAGGGCAAATGA 

AAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGA 

GATAAGGCAAGGAAAGAGGTCGAGGAATATGTAAAAAAAATAGTGGGTG 

15 agagctatgcaaaatcaactaaaaagcgacatacaattactgtagctcta 

GTT AACG AGTTGAAC AAC ATT AAG AACG AGT ATTTGAATAAAATAGTTGA 
ATCAACCTCAGAAAGCCAACTACAGATACTGATGATGGAGAGTCGATCAA 
AAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCAA 
GTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAAC 

20 aagccgacagaaccaggagaaaaggtagcagaagctaagaagaaggttg 

AAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCGTCGTAA 
CTACCCAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCG 
ATGT GG AAGTT A AAAAAGCGG AGCTT G AACT AGT AAAAGTGAAAGCT AA 
CGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAG 
25 AGTAAACAAGCTGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTG 
AAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTAAAGAGCAAG 
GTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAAC 
ACCTGATAAAAAAGAAAATGATGCGAAGTCTTCAGATTCTAGCGTAGGTG 

AAGAAACTCTTC (SEQ ID NO 14). 

30 


BNSDOCID: <WO 9951 187 A2_l_> 



WO 99/51 187 


PCT/US99/07668 


- 12 - 

“Polypeptide C/R2" means a polypeptide comprising a repeat region C within R2, 
wherein the repeat region C has the amino acid sequences from position 327 to position 
433 of the N-terminal choline binding protein A (CbpA) serotype type 4 which has the 
following sequence: 

5 KPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELELAESDVEVK 
KAELELVKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAK 
RKA (SEQ ID NO 4). 


The DNA sequence of polypeptide C/R2 

1 0 AAACC AGAAAAAAAGGTAGC AGAAGCTGAG AAGAAGGTTGAAGAAGCTA 
AGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCGCCGTAACTACCCAAC 
CAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAG 
TTAAAAAAGCGGAGCTTGAACTAGTAAAAGAGGAAGCTAAGGAACCTCG 
AAACGAGGAAAAAGTTAAGCAAGCAAAAGCGGAAGTTGAGAGT AAAAAA 
1 5 GCTGAGGCTAC AAGGTT AGAAAAAATCAAGAC AGATCGT AAAAAAGCAG 
AAGAAGAAGCTAAACGAAAAGCA (SEQ ID NO 15) 

“Polypeptide A/R2" means a polypeptide comprising a repeat region A within R2, 
wherein the repeat region A has the amino acid sequences from position 153 to position 
20 269 of the N-terminal of choline binding protein A (CbpA) serotype type 4 which has 

the following sequence: 

TEPGEKVAEAKKKVEEAEKKAKDQKEEDRRNYPTITYKTLELEIAESDVEVK 
KAELELVKVKANEPRDEQKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAK 
RRADA (SEQ ID NO 5). As shown in Figure 1, region A of polypeptide R2 is the same 
25 region A as within R1 . 

The DNA sequence which encodes the polypeptide A/R2 is: 

AC AGAACC AGGAG A AAAGGT AGC AG AAGCT AAGAAGAAGGTT G AAG AA 
GCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCGTCGTAACTACC 
30 CAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTG 
GAAGTT AAAAAAGCGG AGCTTG AACT AGT AAAAGTGAAAGCT AACG AAC 
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CTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAA 

ACAAGCTGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTGAAGAA 

GCAGAAGAAGAAGCTAAACGAAGAGCAGATGCT (SEQ ID NO 16). 

The identity or location of one or more amino acid residues may be changed or modified 
to include variants such as, for example, deletions containing less than all of the 
residues specified for the protein, substitutions wherein one or more residues specified 
are replaced by other residues and additions wherein one or more amino acid residues are 
added to a terminal or medial portion of the polypeptide (see Figure 2). These molecules 
include: the incorporation of codons "preferred" for expression by selected non- 
mammalian hosts; the provision of sites for cleavage by restriction endonuclease 
enzymes; and the provision of additional initial, terminal or intermediate DNA sequences 
that facilitate construction of readily expressed vectors. Specifically, examples of the 
amino acid substitutions of serotype 4, included but not limited to, are as follows: E at 
position 154 is substituted with K; P at position 155 is substituted with L; G at position 
1 56 is substituted with E; E at position 1 57 is substituted with K; K at position 1 8 1 is 
substituted with E; D at position 182 is substituted with A; R at position 187 is 
substituted with Y, H, or L; I at position 194 is substituted with N; E at position 200 is 
substituted with D; E at position 202 is substituted with D; E at position 209 is 
substituted with K; K at position 212 is substituted with E; V at position 218 is 
substituted with L; V at position 220 is substituted with K or E; K at position 221 is 
substituted with E; N at position 223 is substituted with D or K; P at position 225 is 
substituted withS, T, or R; D at position 227 is substituted with N; E at position 228 is 
substituted with K; Q at position 229 is substituted with E, G, or D; K at position 230 is 
substituted with T; K at position 232 is substituted with N; E at position 235 is 

substituted with K; A at position 236 is substituted with E; E at position 237 is 

substituted with K; S at position 240 is substituted with N; K at position 241 is 

substituted with E; Q at position 242 is substituted with K; K at position 249 is 

substituted with E; K at position 250 is substituted with N; E at position 257 is 
substituted with Q or K; A at position 263 is substituted with L; K at position 264 is 
substituted with E; R at position 265 is substituted with N; R at position 266 is 
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substituted with I; A at position 267 is substituted with K or V; D at position 258 is 
substituted with T; A at position 269 is substituted with D; A at position 291 is 
substituted with T, V, P, G, or X; G at position 294 is substituted with G, A, or E; V at 
position 295 is substituted with D, or A; P at position 295 is substituted with L or F, L 
5 at position 2999 is substituted with P or Q; P at position 328 is substituted with S; E at 
position 329 is substituted with G; E at position 340 is substituted with A; K at position 
343 is substituted with E or D; E at position 347 is substituted with K; D at position 349 
is substituted with A; R at position 354 is substituted with H; E at position 366 is 
substituted with D; E at position 375 is substituted with K; K at position 378 is 
1 0 substituted with E; E at position 390 is substituted with G; P at position 391 is substituted 
with S; N at position 393 is substituted with D; V at position 397 is substituted with I; 
and K at position 408 is substituted with Q. 

“Polypeptide R2 serotype - R6x" means an polypeptide comprising the amino acid 
15 sequences from position 16 to position 444 of the N-terminal truncate of Choline 
Binding Protein A (CbpA) serotype R6x which has the following sequence: 

ENEGSTQAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRKHTQNVAL 
NIKLSA1KTKYLRELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKPGEKVA 
EAKKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAEFDVKVKEAELELVK 
20 EEAKESRNEGTIKQAKEKVESKKAEATRLENIKTDRKKAEEEAKRKADAKLK 
EANVATSDQGKPKGRAKRGVPGELATPDKKENDAK.SSDSSVGEETLPSSSLK 
SGKKVAEAEKKVEEAEKKAKDQKEEDRRNYPTNTYKTLDLEIAESDVKVKE 
AELEL VKEE AKEPRDEEKIKQ AKAK VESKKAE AT RLENIKTDRKKAEEEAKR 

KAAEEDKVKEKPA (SEQ ID NO 7) 

25 

The DNA sequence which encodes polypeptide R2 serotype R6x: 

GAAAACGAAGGAAGTACCCAAGCAGCCACTTCTTCTAATATGGCAAAGAC 
AGAACATAGGAAAGCTGCTAAACAAGTCGTCGATGAATATATAGAAAAA 
ATGTTGAGGGAGATTCAACTAGATAGAAGAAAACATACCCAAAATGTCGC 
30 CTTAAACATAAAGTTGAGCGCAATTAAAACGAAGTATTTGCGTGAATTAA 
ATGTTTTAGAAGAGAAGTCGAAAGATGAGTTGCCGTCAGAAATAAAAGCA 
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AAGTTAGACGCAGCTTTTGAGAAGTTTAAAAAAGATACATTGAAACCAGG 
AGAAAAGGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTAAGAAAAAA 
GCCGAGGATCAAAAAGAAGAAGATCGTCGTAACTACCCAACCAATACTTA 
CAAAACGCTTGAACTTGAAATTGCTGAGTTCGATGTGAAAGTTAAAGAAG 
5 CGGAGCTTGAACTAGTAAAAGAGGAAGCTAAAGAAtCTCGAAACGAGGGC 
ACAATTAAGCAAGCAAAAGAGAAAGTTGAGAGTAAAAAAGCTGAGGCTA 
CAAGGTTAGAAAACAtCAAGACAGAtCGTAAAAAAGCAGAAGAAGAAGCT 
AAACGAAAAGCAGATGCTAAGTTGAAGGAAGCTAATGTAGCGACTTCAG 
AtCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTA 
1 0 GCAACACCTGATAAAAAAGAAAATGATGCGAAGTCTTCAGATTCTAGCGT 
AGGTGAAGAAACTCTTCCAAGCTCATCCCTGAAATCAGGAAAAAAGGTAG 
CAGAAGCTGAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCA 
AAAAGAAGAAGATCGCCGTAACTACCCAACCAATACTTACAAAACGCTTG 
ACCTTGAAATTGCTGAGTCCGATGTGAAAGTTAAAGAAGCGGAGCTTGAA 
1 5 CTAGTAAAAGAGGAAGCTAAGG AACCTCGAGACGAGGAAAAAATTAAGC 
AAGCAAAAGCGAAAGTTGAGAGTAAAAAAGCTGAGGCTACAAGGTTAGA 
AAACATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAA 
GCAGCAGAAGAAGATAAAGTTAAAGAAAAACCAGCTG (SEQ ID NO 17) 

20 Amino acid sequence of CbpA of serotype R6x: 

ENEGSTQAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRKHTQNVAL 

ntklsaiktkylrelnvleekskdelpseikakldaafekfkkdtlkpgekva 

EAKKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAEFDVKVKEAELELVK 
EEAKESRNEGTIKQAKEKVESKKAEATRLENIKTDRKKAEEEAKRKADAKLK 
25 EANVATSDQGKPKGRAKRGVPGELATPDKKENDAKSSDSSVGEETLPSSSLK 

sgkkvaeaekkveeaekkakdqkeedrrnyptntyktldleiaesdvkvke 

AELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLENIKTDRKKAEEEAKR 

KAAEEDKVKEKPAEQPQPAPATQPEKPAPKPEKPAEQPKAEKTDDQQAEEDY 

arrseeeynrltqqqppktekpaqpstpktgwkqengmwyfyntdgsmat 

30 GWLQNNGS W Y YLNANG AM AT G WLQNNGS W Y YLNANGSM ATG WLQNNG 

SWYYLNANGAMATGWLQYNGSWYYLNSNGAMATGWLQYNGSWYYLNA 
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NGDMATGWLQNNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATG 

WVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGSGALAVNTTVDGYG 

VNANGEWVN (SEQ ID NO 8). 

5 DNA sequence encoding the amino acid sequence of the CbpA of serotype R6x: 

GAAAACGAAGGAAGTACCCAAGCAGCCACTTCTTCTAATATGGCAAAGAC 
AGAACATAGGAAAGCTGCTAAACAAGTCGTCGATGAATATATAGAAAAA 
ATGTTGAGGGAGATTCAACTAGATAGAAGAAAACATACCCAAAATGTCGC 
CTTAAACATAAAGTTGAGCGCAATTAAAACGAAGTATTTGCGTGAATTAA 
1 0 ATGTTTT AGAAGAGAAGTCGAAAGATGAGTTGCCGTC AGAAATAAAAGC A 
AAGTTAGACGCAGCTTTTGAGAAGTTTAAAAAAGATACATTGAAACCAGG 
AGAAAAGGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTAAGAAAAAA 
GCCGAGGATCAAAAAGAAGAAGATCGTCGTAACTACCCAACCAATACTTA 
CAAAACGCTTGAACTTGAAATTGCTGAGTTCGATGTGAAAGTTAAAGAAG 
1 5 CGGAGCTTGAACTAGTAAAAGAGGAAGCTAAAGAAtCTCGAAACGAGGGC 
ACAATTAAGCAAGCAAAAGAGAAAGTTGAGAGTAAAAAAGCTGAGGCTA 
CAAGGTTAGAAAACAtCAAGACAGAtCGTAAAAAAGCAGAAGAAGAAGCT 
AAACGAAAAGCAGATGCTAAGTTGAAGGAAGCTAATGTAGCGACTtCAGA 
tCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAG 
20 CAACACCTGATAAAAAAGAAAATGATGCGAAGTCTTCAGATTCTAGCGTA 
GGTGAAGAAACTCTTCCAAGCTCATCCCTGAAATCAGGAAAAAAGGTAGC 
AGAAGCTGAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCAA 
AAAGAAGAAGATCGCCGTAACTACCCAACCAATACTTACAAAACGCTTGA 
CCTTGAAATTGCTGAGTCCGATGTGAAAGTTAAAGAAGCGGAGCTTGAAC 
25 TAGTAAAAGAGGAAGCTAAGGAACCTCGAGACGAGGAAAAAATTAAGCA 
AGCAAAAGCGAAAGTTGAGAGTAAAAAAGCTGAGGCTACAAGGTTAGAA 
AACATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAG 
CAGCAGAAGAAGATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACC 
AGCGCCGGCTACTCAACCAGAAAAACCAGCTCCAAAACCAGAGAAGCCA 
30 GCTGAACAACCAAAAGCAGAAAAAACAGATGATCAACAAGCTGAAGAAG 
ACTATGCTCGTAGATCAGAAGAAGAATATAATCGCTTGACTCAACAGCAA 
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CCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCCAAAAACAGGCT 
GGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATG 
GCAACAGGATGGCTCCAAAACAACGGTTCATGGTACTATCTAAACGCTAA 
TGGTGCTATGGCGACAGGATGGCTCCAAAACAATGGTTCATGGTACTATC 
5 TAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCA 
TGGTACTACCTAAACGCTAATGGTGCTATGGCGACAGGATGGCTCCAATA 
CAATGGTTCATGGTACTACCTAAACAGCAATGGCGCTATGGCGACAGGAT 
GGCTCCAATACAATGGCTCATGGTACTACCTCAACGCTAATGGTGATATG 
GCGACAGGATGGCTCCAAAACAACGGTTCATGGTACTACCTCAACGCTAA 
1 0 TGGTGATATGGCGACAGGATGGCTCCAATACAACGGTTCATGGTATTACC 
TCAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGATGGAGATACC 

TGGT ACT ATCTTG AAGC AT C AGGTGCTAT GAAAGCAAGCCAATGGTTC AA 

AGTATCAGATAAATGGTACTATGTCAATGGCTCAGGTGCCCTTGCAGTCA 

ACACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 

15 TAA(SEQIDNO 18). 

Polypeptide R1 Serotype R6x" means an polypeptide comprising the amino acid 
sequences from position 16 to position 321 of the N-terminal tnmcate/truncate of choline 
binding protein A (CbpA) serotype R6x which has the following sequence : 

20 enegstqaatssnmaktehrkaakqvvdeyiekmlreiqldrrkhtqnval 

NIKLSAIKTKYLRELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKPGEKVA 
EAKKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAEFDVKVKEAELELVK 
EEAKESRNEGTIKQAKEKVESKKAEATRLENIKTDRKKAEEEAKRKADAKLK 
EANVATSDQGKPKGRAKRGVPGELATPDKKENDAKSSDSSVGEETL (SEQ ID 

25 NO 9). 


The DNA sequence which encodes polypeptide R1 is: 

GAAAACGAAGGAAGTACCCAAGCAGCCACTTCTTCTAATATGGCAAAGAC 

AGAACATAGGAAAGCTGCTAAACAAGTCGTCGATGAATATATAGAAAAA 

30 atgttgagggagattcaactagatagaagaaaacatacccaaaatgtcgc 

CTTAAACATAAAGTTGAGCGCAATTAAAACGAAGTATTTGCGTGAATTAA 
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ATGTTTTAGAAGAGAAGTCGAAAGATGAGTTGCCGTCAGAAATAAAAGCA 
AAGTT AG ACGC AGCTTTTGAGAAGTTT AAAAAAG AT AC ATTGAAACC AGG 
AGAAAAGGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTAAGAAAAAA 
GCCGAGGATCAAAAAGAAGAAGATCGTCGTAACTACCCAACCAATACTTA 
CAAAACGCTTGAACTTGAAATTGCTGAGTTCGATGTGAAAGTTAAAGAAG 
CGGAGCTTGAACTAGTAAAAGAGGAAGCTAAAGAATCTCGAAACGAGGG 
C AC AATT AAGC AAGC AAAAGAG AAAGTT G AG AGT A AAAAAGCT G AGGCT 
ACAAGGTTAGAAAACAtCAAGACAGATCGTAAAAAAGCAGAAGAAGAAG 
CTAAACGAAAAGCAGATGCTAAGTTGAAGGAAGCTAATGTAGCGACTTCA 
GATCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGC 
TAGCAACACCTGATAAAAAAGAAAATGATGCGAAGTCTTCAGATTCTAGC 
GTAGGTGAAGAAACTCTTC (SEQ ID NO 19). 


“Polypeptide C/R2 serotype R6x" means an polypeptide comprising a repeat region C 
within R2 (see Figure 2), wherein the repeat region C has the amino acid sequences from 
position 327 to position 433 of the N-terminal of choline binding protein A (CbpA) 
serotype R6x which has the following sequence: 

KS GKKV AE AEKKVEEAEKKAKDQKEEDRRNYPTNTYKTLDLEIAESDVKVK 
EAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLENIKTDRKKAEEEAK 

RKA (SEQ ID NO 10) 


The DNA sequence of polypeptide C/R2 serotype R6x: 

AAATC AGG AAA AA AGGT AGC AG AAGCTG AG A AG AAGGTTG AAG AAGCT G 
AGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCGCCGTAACTACCCAAC 
C AAT ACTT AC AAAACGCTTG ACCTT G AAATT GCT G AGTCCG AT GT G AAAG 
TTAAAGAAGCGGAGCTTGAACTAGTAAAAGAGGAAGCTAAGGAACCTCG 
AGACGAGGAAAAAATT AAGC AAGC AAAAGCGAAAGTTGAGAGTAAAAAA 
GCTGAGGCTACAAGGTTAGAAAACATCAAGACAGATCGTAAAAAAGCAG 
AAGAAGAAGCTAAACGAAAAGCA (SEQ ID NO 20). 
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Polypeptide A/R2 serotype R6x" means an polypeptide comprising a repeat region A 
within R2 (see Figure 2), wherein the repeat region A has the amino acid sequences from 
position 155 to position 265 of the N-terminal of choline binding protein A (CbpA) 
serotype R6X which has the following sequence: 

5 PGEKVAEAKKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAEFDVKVKE 
AELELVKEEAKESRNEGTIKQAKEKVESKKAEATRLENIKTDRKKAEEEAKR 

KADA (SEQ ID NO 1 1) 


The DNA sequence which encodes the polypeptide A/R2 serotype R6x is: 

1 0 CCAGGAGAAAAGGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTAAGA 
AAAAAGCCGAGGATCAAAAAGAAGAAGATCGTCGTAACTACCCAACCAA 
TACTTACAAAACGCTTGAACTTGAAATTGCTGAGTTCGATGTGAAAGTTA 
AAGAAGCGGAGCTTGAACTAGTAAAAGAGGAAGCTAAAGAAtCTCGAAAC 
GAGGGCACAATTAAGCAAGCAAAAGAGAAAGTTGAGAGTAAAAAAGCTG 
1 5 AGGCTACAAGGTTAGAAAACAtCAAGACAGATCGTAAAAAAGCAGAAGA 

AGAAGCTAAACGAAAAGCAGATGCT (SEQ ID NO 21). 

This invention is directed to an isolated polypeptide, wherein the isolated polypeptide 
consists of the amino acid sequence as set forth in SEQ ID NOS 22 or 23, including 
20 fragments, mutants, variants, or analogs, or derivatives, thereof. 

SPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDV 

EVKXAELELVKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEE 

EAKRKAAEEDKVKEKPA (SEQ ID NO 22; serotype 4; position 323-434); or 

PSSSLKSGKKVAEAEKKVEEAEKKAKDQKEEDRRNYPTNTYKTLDLEIAESD 
25 VKVKEAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLENIKTDRKKAE 

EEAKRKAAEEDKVKEKRA (SEQ ID NO 23, serotype R6x; position 322-434). 

“Polypeptide B/R2" means a polypeptide comprising the amino acid sequences from 
position 270 to position 326 of the N-terminal truncate of choline binding protein A 
30 (CbpA) serotype type 4 as set forth in Figure 2. “Polypeptide B/R2 serotype - R6x" 
means an polypeptide comprising the amino acid sequences from position 264 to 
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position 326 of the N-terminal truncate of Choline Binding Protein A (CbpA) serotype 
R6x as set forth in Figure 2. This invention contemplates a polypeptide having the amino 
acid sequence of regions A, B, C, A+B, B+C, A+C as shown in Figure 1. 

5 Further, this invention provides an isolated polypeptide comprising an amino acid 
sequence of a N-terminal choline binding protein A truncate, wherein the polypeptide 
has the amino acid KXXE (SEQ ID NO 6). 

This invention is directed to a polypeptide comprising an amino acid sequence of a N- 
1 0 terminal choline binding protein A truncate, wherein the amino acid sequence is set forth 

in Figure 2. In one embodiment, the polypeptide has an amino acid sequence which is a 
conserved region as set forth in Figure 2. For example, conserved regions include but are 
not limited to amino acid sequence 158 to 172; 300 to 321; 331 to 339; 355 to 365, 367 
to 374; 379 to 389; 409 to 427; and 430 to 447 Figure 2 sets forth homologies of 
1 5 various serotypes of the nucleic acid and amino acid sequence of the N-terminal region 
of CbpA which are contemplated by this invention. 

This invention provides an isolated polypeptide comprising an ammo acid sequence of 
a N-terminal choline binding protein A truncate, wherein the polypeptide has lectin 
20 activity and does not bind to choline. In one embodiment the polypeptide has the ammo 

acid sequence as set forth in any of SEQ ID NO 1, 3-5, 7, or 9-1 1 including fragments, 
mutants, variants, analogs, or derivatives, thereof. 

As used herein, “a polypeptide having a lectin activity” means a polypeptide, peptide 
25 or protein which binds noncovalendy to a carbohydrate. As defined herein, “adhesin 
means noncovalent binding of a bacteria to a human cell or secretion that is stable 
enough to withstand washing. As defined herein, “binds to the LNnT’ means binds to 
Lacto-N -neotetraose coated substrates more than albumin-control. 

30 This invention provides an isolated immunogenic polypeptide comprising an amino acid 
sequence of an N-terminal choline binding protein A truncate. It is contemplated by this 
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mvention that the immunogenic polypeptide has the amino acid sequence as set forth in 
any ofSEQ ID NOS 1,3-7, or 9-11, including fragments, mutants, variants, analogs, 
or derivatives, thereof. This invention provides an isolated polypeptide comprising an 
ammo acid sequence of a N-terminal choline binding protein A truncate as set forth in 
5 Figure 2. In one embodiment, the polypeptide has an amino acid sequence which is a 
conserved region as set forth in Figure 2. 

This invention is directed to analogs of the polypeptide which comprise the amino acid 
sequence as set forth above. The analog polypeptide may have an N-terminal methionine 
10 or an N-terminal polyhistidine optionally attached to the N or COOH terminus of the 
polypeptide which comprise the amino acid sequence. 

In another embodiment, this invention contemplates peptide fragments of the 
polypeptide which result from proteolytic digestion products of the polypeptide. In 
1 5 another embodiment, the derivative of the polypeptide has one or more chemical moieties 

attached thereto. In another embodiment the chemical moiety is a water soluble polymer. 
In another embodiment the chemical moiety is polyethylene glycol. In another 
embodiment the chemical moiety is mon-, di-, tri- or tetrapegylated. In another 
embodiment the chemical moiety is N-terminal monopegylated. 

20 

Attachment of polyethylene glycol (PEG) to compounds is particularly useful because 
PEG has very low toxicity in mammals (Carpenter et al., 1971). For example, a PEG 
adduct of adenosine deaminase was approved in the United States for use in humans for 
the treatment of severe combined immunodeficiency syndrome. A second advantage 
25 afforded by the conjugation of PEG is that of effectively reducing the immunogenicty 
and antigenicity of heterologous compounds. For example, a PEG adduct of a human 
protein might be useful for the treatment of disease in other mammalian species without 
the risk of triggering a severe immune response. The compound of the present invention 
may be delivered in a microencapsulation device so as to reduce or prevent an host 
30 immune response against the compound or against cells which may produce the 
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compound. The compound of the present invention may also be delivered 
microencapsulated in a membrane, such as a liposome. 

Numerous activated forms of PEG suitable for direct reaction with proteins have been 
5 described. Useful PEG reagents for reaction with protein amino groups include active 
esters of carboxylic acid or carbonate derivatives, particularly those in which the leaving 
groups are N-hydroxysuccinimide, p-nitrophenol, imidazole or l-hydroxy-2- 
nitrobenzene-4-sulfonate. PEG derivatives containing maleimido or haloacetyl groups 
are useful reagents for the modification of protein free sulfhydryl groups. Likewise, PEG 
10 reagents containing amino hydrazine or hydrazide groups are useful for reaction with 
aldehydes generated by periodate oxidation of carbohydrate groups in proteins. 

In one embodiment, the amino acid residues of the polypeptide described herein are 
preferred to be in the M L" isomeric form. In another embodiment, the residues in the "D" 
15 isomeric form can be substituted for any L-amino acid residue, as long as the desired 
functional property of lectin activity is retained by the polypeptide. NH 2 refers to the free 
amino group present at the amino terminus of a polypeptide. COOH refers to the free 
carboxy group present at the carboxy terminus of a polypeptide. Abbreviations used 
herein are in keeping with standard polypeptide nomenclature, J. Biol. Chem., 243.3552- 
20 59 (1969). 

It should be noted that all amino-acid residue sequences are represented herein by 
formulae whose left and right orientation is in the conventional direction of amino- 
terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
25 beginning or end of an amino acid residue sequence indicates a peptide bond to a further 
sequence of one or more amino-acid residues. 

Synthetic polypeptide, prepared using the well known techniques of solid phase, liquid 
phase, or peptide condensation techniques, or any combination thereof, can include 
30 natural and unnatu ral amino acids. Amino acids used for peptide synthesis may be 
standard Boc (N“-amino protected N°-t-butyloxycarbonyl) amino acid resin with the 
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standard deprotectmg, neutralization, coupling and wash protocols of the original solid 
phase procedure of Merrifield (1963, J. Am. Chem. Soc. 85:2149-2154), or the base- 
labile N“-amino protected 9-fluorenylmethoxycarbonyl (Fmoc) amino acids first 
described by Carpino and Han (1972, J. Org. Chem. 37:3403-3409). Thus, polypeptide 
5 of the invention may comprise D-amino acids, a combination of D- and L-amino acids, 
and various "designer" amino acids (e.g., P-methyl amino acids, Ca-methyl amino acids, 
and Na-methyl ammo acids, etc.) to convey special properties. Synthetic amino acids 
include ornithine for lysine, fluorophenylalanine for phenylalanine, and norleucine for 
leucine or isoleucine. Additionally, by assigning specific amino acids at specific 
1 0 coupling steps, a-helices, p turns, p sheets, y-tums, and cyclic peptides can be generated. 


In one aspect of the invention, the peptides may comprise a special amino acid at the C- 
terminus which incorporates either a C0 2 H or CONH 2 side chain to simulate a free 
15 glycine or a glycine-amide group. Another way to consider this special residue would 
be as a D or L amino acid analog with a side chain consisting of the linker or bond to the 
bead. In one embodiment, the pseudo-free C-tenninal residue may be of the D or the L 

optical configuration; in another embodiment, a racemic mixture of D and L-isomers may 
be used. 

20 

In an additional embodiment, pyroglutamate may be included as the N-terminal residue 
of the peptide. Although pyroglutamate is not amenable to sequence by Edman 
degradation, by limiting substitution to only 50% of the peptides on a given bead with 
N-terminal pyroglutamate, there will remain enough non-pyroglutamate peptide on the 
25 bead for sequencing. One of ordinary skill would readily recognize that this technique 
could be used for sequencing of any peptide that incorporates a residue resistant to 
Edman degradation at the N-terminus. Other methods to characterize individual peptides 
that demonstrate desired activity are described in detail infra. Specific activity of a 
peptide that comprises a blocked N-terminal group, e.g., pyroglutamate, when the 
30 particular N-terminal group is present in 50% of the peptides, would readily be 
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demonstrated by comparing activity of a completely (100%) blocked peptide with a non- 
blocked (0%) peptide. 

In addition, the present invention envisions preparing peptides that have more well 
5 defined structural properties, and the use of peptidomimetics, and peptidomimetic bonds, 
such as ester bonds, to prepare peptides with novel properties. In another embodiment, 
a peptide may be generated that incorporates a reduced peptide bond, i.e., R,-CH 2 -NH- 
R 2 , where R } and R 2 are amino acid residues or sequences. A reduced peptide bond may 
be introduced as a dipeptide subunit. Such a molecule would be resistant to peptide bond 
10 hydrolysis, e.g., protease activity. Such peptides would provide ligands with unique 
function and activity, such as extended half-lives in vivo due to resistance to metabolic 
breakdown, or protease activity. Furthermore, it is well known that in certain systems 
constrained peptides show enhanced functional activity (Hruby, 1982, Life Sciences 
31:189-199; Hruby et al., 1990, Biochem J. 268:249-262); the present invention provides 
15 a method to produce a constrained peptide that incorporates random sequences at all 
other positions. 

A constrained, cyclic or rigidized peptide may be prepared synthetically, provided that 
in at least two positions in the sequence of the peptide an amino acid or amino acid 
20 analog is inserted that provides a chemical functional group capable of cross-linking to 
constrain, cyclise or rigidize the peptide after treatment to form the cross-link. 
Cyclization will be favored when a turn-inducing amino acid is incorporated. Examples 
of amino acids capable of cross-linki n g a peptide are cysteine to form disulfide, aspartic 
acid to form a lactone or a lactase, and a chelator such as y -carboxy 1-glutamic acid (Gla) 
25 (Bachem) to chelate a transition metal and form a cross-link. Protected y-carboxyl 
glutamic acid may be prepared by modifying the synthesis described by Zee-Cheng and 
Olson (1980, Biophys. Biochem. Res. Commun. 94:1 128-1 132). A peptide in which the 
peptide sequence comprises at least two amino acids capable of cross-linking may be 
treated, e.g., by oxidation of cysteine residues to form a disulfide or addition of a metal 
30 ion to form a chelate, so as to cross-link the peptide and form a constrained, cyclic or 
rigidized peptide. 
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The present invention provides strategies to systematically prepare cross-links. For 
example, if four cysteine residues are incorporated in the peptide sequence, different 
protecting groups may be used (Hiskey, 1981, in The Peptides: Analysis, Synthesis, 
5 Biology, Vol. 3, Gross and Meienhofer, eds., Academic Press: New York, pp. 137-167; 
Ponsanti et al„ 1990, Tetrahedron 46:8255-8266). The first pair of cysteine may be 
deprotected and oxidized, then the second set may be deprotected and oxidized. In this 
way a defined set of disulfide cross-links may be formed. Alternatively, a pair of 

cysteine and a pair of collating amino acid analogs may be incoiporated so that the cross- 
1 0 links are of a different chemical nature. 

The following non-classical amino acids may be incorporated in the peptide in order to 
introduce particular conformational motifs: l,2,3,4-tetrahydroisoquinoline-3-carboxylate 
(Kazmierski et al., 1991, J. Am. Chem. Soc. 113:2275-2283); (2S,3S)-methyl- 
15 phenylalanine, (2S,3R)-methyl-phenylalanine, (2R,3S)-methyl-phenylalanine and 
(2R,3R)-methyl-phenylalanine (Kazmierski and Hruby, 1991, Tetrahedron Lett.); 2- 
ammotetrahydronaphthalene-2-carboxylic acid (Landis, 1989, Ph.D. Thesis, University 
of Arizona); hydroxy- 1 ,2,3,4-tetrahydroisoquinoline-3-carboxylate (Miyake et al., 1989, 

J. Takeda Res. Labs. 43:53-76); P-carboline (D and L) (Kazmierski, 1988, Ph.D. Thesis, 
20 University of Arizona); HIC (histidine isoquinoline carboxylic acid) (Zechel et al., 1991, 
Int. J. Pep. Protein Res. 43); and HIC (histidine cyclic urea) (Dhaianipragada). * 

The following amino acid analogs and peptidomimetics may be incorporated into a 
peptide to induce or favor specific secondary structures: LL-Acp (LL-3-amino- 

25 2-propenidone-6-carboxylic acid), a P-tum inducing dipeptide analog (Kemp et al., 1985, 

J. Org. Chem. 50:5834-5838); P-sheet inducing analogs (Kemp et al., 1988, Tetrahedron 
Lett. 29:5081-5082); p-tum inducing analogs (Kemp et al., 1988, Tetrahedron Lett. 
29:5057-5060); --helix inducing analogs (Kemp et al., 1988, Tetrahedron Lett. 29:4935- 
4938); Y-tum inducing analogs (Kemp et al., 1989, J. Org. Chem. 54:109:115); and 
30 analogs provided by the following references: Nagai and Sato, 1 985, Tetrahedron Lett. 
26:647-650; DiMaio et al., 1989, J. Chem. Soc. Perkin Trans, p. 1687; also a Gly-Ala 
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tum analog (Kahn et al., 1989, Tetrahedron Lett. 30:2317); amide bond isostere (Jones 
et al., 1988, Tetrahedron Lett. 29:3853-3856); tretrazol (Zabrocki et al., 1988, J. Am. 
Chem. Soc. 110:5875-5880); DTC (Samanen et al., 1990, Int. J. Protein Pep. Res. 
35:501:509); and analogs taught in Olson et al., 1990, J. Am. Chem. Sci. 112:323-333 
5 and Garvey et al., 1990, J. Org. Chem. 56:436. Confoimationally restricted mimetics of 
beta turns and beta bulges, and peptides containing them, are described in U.S. Patent No. 
5,440,013, issued August 8, 1995 to Kahn. 

The present invention further provides for modification or derivatization of the 
10 polypeptide or peptide of the invention. Modifications of peptides are well known to one 
of ordinary skill, and include phosphorylation, carboxymethylation, and acylation. 
Modifications may be effected'by chemical or enzymatic means. In another aspect, 
glycosylated or fatty acylated peptide derivatives may be prepared. Preparation of 
glycosylated or fatty acylated peptides is well known in the art. Fatty acyl peptide 
1 5 derivatives may also be prepared. For example, and not by way of limitation, a free 
amino group (N-terminal or lysyl) may be acylated, e.g., myristoylated. In another 
embodiment an amino acid comprising an aliphatic side chain of the structure - 
(CH 2 ) n CH 3 may be incorporated in the peptide. This and other peptide-fatty acid 
conjugates suitable for use in the present invention are disclosed in U.K. Patent GB- 
20 8809162.4, International Patent Application PCT/AU89/00166, and reference 5, supra. 

Mutations can be made in a nucleic acid encoding the polypeptide such that a particular 
codon is changed to a codon which codes for a different amino acid. Such a mutation is 
generally made by making the fewest nucleotide changes possible. A substitution 
25 mutation of this sort can be made to change an amino acid in the resulting protein m a 
non-conservative manner (i.e., by changing the codon from an amino acid belonging to 
a grouping of amino acids having a particular size or characteristic to an ammo acid 
belonging to another grouping) or in a conservative manner (i.e., by changing the codon 
from an amino acid belonging to a grouping of amino acids having a particular size or 
30 characteristic to an amino acid belonging to the same grouping). Such a conservative 
change generally leads to less change in the structure and function of the resulting 
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protein. A non-conservative change is more likely to alter the stntcture, activity or 
fcnctton of the respiting ptotein. The present invention should be considered to include 
sequences containing conservative changes which do not significantly alter the activity 
or binding characteristics of the resulting protein. Substitutes for an amino acid within 
5 the sequence may be selected from other members of the class to which the amino acid 
belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and methionine. Amino acids 
containing aromatic ring structures am phenylalanine, ttyptophan, and tyrosine. The 
polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, 
10 asparagine, and glutamine. The positively charged (basic) amino acids include arginine, 
lysine and histidine. The negatively charged (acidic) amino acids include aspartic acid 
and glutamic acid. Such alterations will not be expected to affect apparent molecular 
weight as determined by polyacrylamide gel electrophoresis, or isoelectric point. 

1 5 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 


20 


25 


Synthetic DNA sequences allow convenient construction of genes which will express 
analogs or muteins . A general method for site-specific incorporation of unnatural 
ammo acids into proteins is described in Noren, et al. Science, 244: 182-188 (April 1 989). 
This method may be used to create analogs with unnatural amino acids. 


fir accordance with the present invention there may be employed conventional molecular 
biology, microbiology, and recombinant DNA techniques within the skill of the art Such 
techniques are explained fully in tire literature. See, e.g„ Sambrook et al, "Molecular 
Cloning: A Laboratoty Manual" (1989); "Garent Protocols in Molecular Biology- 
30 Volumes I-ln [Ausubel, R. M„ ed. (1994),; "Ce„ Biology: A Laboratoty Handbook- 
Volumes I-m [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" Volumes I- 
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III [Coligan, J. E„ ed. (1994)]; "Oligonucleotide Synthesis" (MJ. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; "Transcription 
And Translation" [B.D. Hames & S J. Higgins, eds. (1984)]; "Animal Cell Culture" [R.I. 
Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, 

5 "A Practical Guide To Molecular Cloning" (1984). 

In an additional embodiment, pyroglutamate may be included as the N-terminal residue 
of the peptide. Although pyroglutamate is not amenable to sequence by Edman 
degradation, by limiting substitution to only 50% of the peptides on a given bead with 
10 N-terminal pyroglutatamate, there will remain enough non-pyroglutamate peptide on 
the bead for ^quencing. One of ordinary skill in would readily recognize that this 
technique could be used for sequencing of any peptide that incorporates a residue 
resistant to Edman degradation at the N-terminus. Other methods to characterize 
individual peptides that demonstrate desired activity are described in detail infra. 
15 Specific activity of a peptide that comprises a blocked N-terminal group, e.g., 
pyroglutamate, when the particular N-terminal group is present in 50% of the peptides, 
would readily be demonstrated by comparing activity of a completely (100%) blocked 
peptide with a non-blocked (0%) peptide. 

20 Chemical Moieties For Derivatization. Chemical moieties suitable for denvatization 
may be selected from among water soluble polymers. The polymer selected should be 
water soluble so that the component to which it is attached does not precipitate in an 
aqueous environment, such as a physiological environment. Preferably, for therapeutic 
use of the end-product preparation, the polymer will be pharmaceutically acceptable. 
25 One skilled in the art will be able to select the desired polymer based on such 
considerations as whether the polymer/component conjugate will be used therapeutically, 
and if so, the desired dosage, circulation time, resistance to proteolysis, and other 
considerations. For the present component or components, these may be ascertained 
using the assays provided herein. 

30 
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The water soluble polymer may be selected from the greup consisting of, for example, 
polyethylene glycol, copolymers of ethylene glycol/propylene glycol, 
carboxymethylcellulose, dextran, polyvinyl alcohol, polyvinyl pyrrolidone, poly-1, 
3-dioxolane, poly-l,3,6-trioxane, ethylene/maleic anhydride copolymer, polyaminoacids 
5 (either homopolymers or random copolymers), and dextran or poly(n-vinyl 
pynolidonejpolyethylene glycol, propropylene glycol homopolymem, prolypropylene 
oxide/ethylene oxide co- polymem, polyoxyethylated polyols and polyvinyl alcohol. 

Polyethylene glycol propionaldenhyde may have advantages in manufacturing due to its 
stability in water. 

10 

The polymer may be of any molecular weight, and may be blanched or unbranched. For 
polyethylene glycol, the prefemed molecular weight is between about 2kDa and about 
lOOfcDa (the term "about" indicating that in preparations of polyethylene glycol, some 
molecules will weigh more, some less, than the stated molecular weight) for ease in 
15 handling and manufacturing. Other sizes may be used, depending on the desired 
therapeutic profile (eg., the duration of sustained release desired, the effects, if any on 
biological activity, the ease in handling, the degree or lack of antigenicity and other 
known effects of the polyethylene glycol to a therapeutic protein or analog). 

20 The number of polymer molecules so attached may vary, and one skilled in the art will 

be able to ascertain the effect on functioa One may mono-derivatize, or may pmvide for 

a di-, tri-, tetra- or some combination of derivatization, with the same or different 

chemical moieties (e.g, polymem, such as different weights of polyethylene glycols). 

The proportion of polymer molecules to component or components molecules will vary, 

25 as will their concentrations in the reaction mixture. In general, the optimum ratio (in 

terms of efficiency of reaction in that there is no excess unreacted component or 

components and polymer) will be determined by factors such as the desired degree of 

derivatization (eg, mono, di-, tri-, etc.), the molecular weight of the polymer selected 

whether the polymer is branched or unbranched, and the reaction conditions 
30 
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The polyethylene glycol molecules (or other chemical moieties) should be attached to the 
component or components with consideration of effects on functional or antigenic 
domains of the protein. There are a number of attachment methods available to those 
skilled in the art, e.g., EP 0 401 384 herein incorporated by reference (coupling PEG to 
5 G-CSF), see also Malik et al., 1992, Exp. Hematol. 20:1028-1035 (reporting pegylation 
of GM-CSF using tresyl chloride). For example, polyethylene glycol may be covalently 
bound through amino acid residues via a reactive group, such as, a free amino or carboxyl 
group. Reactive groups are those to which an activated polyethylene glycol molecule 
may be bound. The amino acid residues having a free amino group include lysine 
10 residues and the - terminal amino acid residues; those having a free carboxyl group 
include aspartic acid residues glutamic acid residues and the C-terminal amino acid 
residue. Sulfhydrl groups may also be used as a reactive group for attaching the 
polyethylene glycol molecule(s). Preferred for therapeutic purposes is attachment at an 
amino group, such as attachment at the N-terminus or lysine group. 

15 

This invention provides an isolated nucleic acid encoding a polypeptide comprising an 
amino acid sequence of a N-terminal choline binding protein A truncate. This invention 
provides an isolated nucleic acid encoding a polypeptide comprising an ammo acid 
sequence of a N-terminal choline binding protein A truncate as set forth in Figure 2. In 
20 one embodiment the nucleic acid is set forth in any of SEQ ID NOS 12, 14-17, or 19-21, 
including fragments, mutants, variants, analogs, or derivatives, thereof. The nucleic 
acid is DNA, cDNA, genomic DNA, RNA. Further, the isolated nucleic acid may be 
operatively linked to a promoter of RNA transcription. It is contemplated that the nucleic 
acid is used to competitively inhibit the lectin activity. 

25 

A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA 
segment may be attached so as to bring about the replication of the attached segment. 

A "DNA" refers to the polymeric form of deoxyribonucleotides (adenine, guanine, 
30 thymine, or cytosine) in its either single stranded form, or a double-stranded helix. This 
term refers only to the primary and secondary structure of the molecule, and does not 
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limit it to any particular tertiary forms. Thus, this term includes double-stranded DNA 
found, inter alia, in linear DNA molecules (e.g., restriction fiagments), vimses, plasmids, 
and chromosomes. In discussing the structure of particular double-stranded DNA 
molecules, sequences may be described herein according to the normal convention of 
5 giving only the sequence in the 5' to 3' direction along the nontranscribed strand of DNA 
(i.e., the strand having a sequence homologous to the mRNA). 

A DNA sequence is "operatively linked" to an expression control sequence when the 
expression control sequence controls and regulates the transcription and translation of 
10 that DNA sequence. The term "operatively linked" includes having an appropriate start 
signal (e.g., ATG) in front of the DNA sequence to be expressed and maintaining the 
correct reading frame to permit expression of the DNA sequence under the control of the 
expression control sequence and production of the desired product encoded by the DNA 
sequence. If a gene that one desires to insert into a recombinant DNA molecule does not 
15 contain an appropriate start signal, such a start signal can be inserted in front of the gene. 

Further this invention also provides a vector which comprises the above-described 
nucleic acid molecule. The promoter may be, or is identical to, a bacterial, yeast, insect 
or mammalian promoter. Further, the vector may be a plasmid, cosmid, yeast artificial 
20 chromosome (YAC), bacteriophage or eukaryotic viral DNA. 

Other numerous vector backbones known in the art as useful for expressing protein may 
be employed. Such vectors include, but are not limited to: adenovirus (AV), adeno- 
associated vims (AAV), simian vims 40 (SV40), cytomegalovirus (CMV), mouse 
25 mammary tumor virus (MMTV), Moloney murine leukemia virus, DNA delivery 
systems, i.e. liposomes, and-expression plasmid delivery systems. Further, one class of 
vectors comprises DNA elements derived from vimses such as bovine papilloma vims, 
polyoma vims, baculovims, retroviruses or Semliki Forest vims. Such vectors may be 

obtained commercially or assembled from the sequences described by methods well- 
30 known in the art. 

This invention also provides a host vector system for the production of a polypeptide 
which compnses the vector of a suitable host cell. Suitable host cells include, but are not 
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limited to, prokaryotic or eukaryotic cells, e.g. bacterial cells (including gram positive 
cells), yeast cells, fungal cells, insect cells, and animals cells. Numerous mammalian 
cells may be used as hosts, including, but not limited to, the mouse fibroblast cell NIH 
3T3, CHO cells, HeLa cells, Ltk cells, Cos cells, etc. 

5 

A wide variety of host/expression vector combinations may be employed in expressing 
the DNA sequences of this invention. Useful expression vectors, for example, may 
consist of segments of chromosomal, non-chromosomal and synthetic DNA sequences. 
Suitable vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli 
10 plasmids col El, pCRl, pBR322, pMB9 and their derivatives, plasmids such as RP4; 
phage DNAS, e.g., the numerous derivatives of phage A, e.g., NM989, and other phage 
DNA, e.g, M13 and filamentous single stranded phage DNA; yeast plasmids such as the 
2p plasmid or derivatives thereof; vectors useful in eukaryotic cells, such as vectors 
useful in insect or mammalian cells; vectors derived from combinations of plasmids and 
1 5 phage DNAs, such as plasmids that have been modified to employ phage DNA or other 
expression control sequences; and the like. 

Any of a wide variety of expression control sequences — sequences that control the 
expression of a DNA sequence operatively linked to it ~ may be used in these vectors to 
20 express the DNA sequences of this invention. Such useful expression control sequences 
include, for example, the early or late promoters of SV40, CMV, vaccinia, polyoma or 
adenovirus, the lac system, the trp system, the TAC system, the TRC system, the LTR 
system, the major operator and promoter regions of phage A, the control regions of fd 
coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the 
25 promoters of acid phosphatase {e.g. , Pho5), the promoters of the yeast cc-mating factors, 

and other sequences known to control the expression of genes of prokaryotic or 
eukaryotic cells or their viruses, and various combinations thereof. 

A wide variety of unicellular host cells are also usefiil in expressing the DNA sequences 
30 of this invention. These hosts may include well known eukaryotic and prokaryotic hosts, 
such as strains of E. coli. Pseudomonas, Bacillus, Streptomyces, fungi such as yeasts, 
and animal cells, such as CHO, Rl.l, B-W and L-M cells, African Green Monkey kidney 
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cells (e.g., COS 1, COS 7, BSC1, BSC40, and BMT10), insect cells ( e.g ., Sf9), and 
human cells and plant cells in tissue culture. 

It will be understood that not all vectors, expression control sequences and hosts will 
5 function equally well to express the DNA sequences of this invention. Neither will all 
hosts function equally well with the same expression system. However, one skilled in 
the art will be able to select the proper vectors, expression control sequences, and hosts 
without undue experimentation to accomplish the desired expression without departing 
from the scope of this invention. For example, in selecting a vector, the host must be 
1 0 considered because the vector must function in it. The vector's copy number, the ability 

to control that copy number, and the expression of any other proteins encoded by the 
vector, such as antibiotic markers, will also be considered. 

In selecting an expression control sequence, a variety of factors will normally be 
15 considered. These include, for example, the relative strength of the system, its 
controllability, and its compatibility with the particular DNA sequence or gene to be 
expressed, particularly as regards potential secondary structures. Suitable unicellular 
hosts will be selected by consideration of, e.g., their compatibility with the chosen vector, 
their secretion characteristics, their ability to fold proteins correctly, and their 
20 fermentation requirements, as well as the toxicity to the host of the product encoded by 
the DNA sequences to be expressed, and the ease of purification of the expression 
products. 

This invention further provides a method of producing a polypeptide which comprises 
25 growing the above-described host vector system under suitable conditions pe rmitting the 
production of the polypeptide and recovering the polypeptide so produced. 

This invention further provides an antibody capable of specifically rer.n gni 7 .in g or 
binding to the isolated polypeptide. The antibody may be a monoclonal or polyclonal 
30 antibody. Further, the antibody may be labeled with a detectable marker that is either 
a radioactive, colorimetric, fluorescent, or a luminescent marker. The labeled antibody 
may be a polyclonal or monoclonal antibody. In one embodiment, the labeled antibody 
is a purified labeled antibody. Methods of labeling antibodies are well known in the art. 
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The term "antibody" includes, by way of example, both naturally occurring and non- 
naturally occurring antibodies. Specifically, the term "antibody" includes polyclonal and 
monoclonal antibodies, and fragments thereof. Furthermore, the term "antibody" 
includes chimeric antibodies and wholly synthetic antibodies, and fragments thereof.Such 
5 antibodies include but are not limited to polyclonal, monoclonal, chimeric, single chain. 
Fab fragments, and an Fab expression library. 

Various procedures known in the art may be used for the production of polyclonal 
antibodies to polypeptide or derivatives or analogs thereof (see, e.g.. Antibodies — A 
10 Laboratory Manual, Harlow and Lane, eds., Cold Spring Harbor Laboratory Press: Cold 
Spring Harbor, Ne York, 1988). For the production of antibody, various host animals can 
be immunized by injection with the truncated CbpA, or a derivative (e.g. , fragment or 
fusion protein) thereof, including but not limited to rabbits, mice, rats, sheep, goats, etc. 
In one embodiment, the polypeptide can be conjugated to an immunogenic carrier, e.g., 
15 bovine serum albumin (BSA) or keyhole limpet hemocyanin (KLH). Various adjuvant 
may be used to increase the immunological response, depending on the host species. 

For preparation of monoclonal antibodies, or fragment, analog, or derivative thereof, any 
technique that provides for the production of antibody molecules by continuous cell lines 
20 in culture may be used (see, e.g.. Antibodies — A Laboratory Manual, Harlow and Lane, 
eds.. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, Ne York, 1988). These 
include but are not limited to the hybridoma technique originally developed by Kohler 
and Milstein (1975, Nature 256:495-497), as well as the trioma technique, the human B- 
cell hybridoma technique (Kozbor et al., 1983, Immunology Today 4:72), and the EBV- 
25 hybridoma technique to produce human monoclonal antibodies (Cole et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In an 
additional embodiment of the invention, monoclonal antibodies can be produced in germ- 
free animals utilizing recent technology (PCT/US90/02545). According to the invention, 
human antibodies may be used and can be obtained by using human hybridomas (Cote 
30 et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030) or by transforming human B 
cells with EBV virus in vitro (Cole et al., 1985, in Monoclonal Antibodies and Cancer 
Therapy , Alan R. Liss, pp. 77-96). In fact, according to the invention, techniques 
developed for the production of "chimeric antibodies" (Morrison et al., 1984, J. Bacteriol. 
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159-870; Neuberger et al., 1984, Nature 312:604-608; Takeda et al., 1985, Nature 
314:452-454) by splicing the genes from a mouse antibody molecule specific for a 
polypeptide together with genes from a human antibody molecule of appropriate 
biological activity can be used; such antibodies are within the scope of this invention. 

5 Such human or humanized chimeric antibodies are preferred for use in therapy of human 
diseases or disorders (described infra), since the human or humanized antibodies are 
much less likely than xenogenic antibodies to induce an immune response, in particular 
an allergic response, themselves. An additional embodiment of the invention utilizes the 
techniques described for the construction of Fab expression libraries (Huse et al., 1989, 
10 Science 246:1275-1281) to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity for the polypeptide, or its derivatives, or analogs. 

Antibody fragments which contain the idiotype of the antibody molecule can be 
generated by known techniques. For example, such fragments include but are not limi ted 
15 to: the F(ab') 2 fragment which can be produced by pepsin digestion of the antibody 
molecule; the Fab' fragments which can be generated by reducing the disulfide bridges 
of the F(ab') 2 fragment, and the Fab fragments which can be generated by treating the 
antibody molecule with papain and a reducing agent. 

20 In the production of antibodies, screening for the desired antibody can be accomplished 
by techniques known in the art, e.g., radioimmunoassay, ELISA (enzyme-linked 
immunosorbant assay), "sandwich" immunoassays, immunoradiometric assays, gel 
diffusion precipitin reactions, immunodiffusion assays, in situ immunoassays (using 
colloidal gold, enzyme or radioisotope labels, for example), western blots, precipitation 
25 reactions, agglutination assays (e.g., gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, protein A assays, and 
immunoelectrophoresis assays, etc. In one embodiment, antibody binding is detected by 
detecting a label on the primary antibody. In another embodiment, the primary antibody 
is detected by detecting binding of a secondary antibody or reagent to the primary 
30 antibody. In a further embodiment, the secondary antibody is labeled. Man y means are 
known in the art for detecting binding in an immunoassay and are wi thin the scope of the 
present invention. 
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Antibodies can be labeled for detection in vitro , e.g., with labels such as enzymes, 
fluorophores, chromophores, radioisotopes, dyes, colloidal gold, latex particles, and 
chemiluminescent agents. Alternatively, the antibodies can be labeled for detection in 
vivo, e.g., with radioisotopes (preferably technetium or iodine); magnetic resonance shift 
5 reagents (such as gadolinium and manganese); or radio-opaque reagents. 

The labels most commonly employed for these studies are radioactive elements, 
enzymes, chemicals which fluoresce when exposed to ultraviolet light, and others.A 
number of fluorescent materials are known and can be utilized as labels. These include, 
10 for example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and Lucifer 
Yellow. A particular detecting material is anti-rabbit antibody prepared in goats and 
conjugated with fluorescein through an isothiocyanate. The polypeptide can also be 
labeled with a radioactive element or with an enzyme. The radioactive label can be 
detected by any of the currently available counting procedures. The preferred isotope 
15 may be selected from 3 H, 14 C, 32 P, 35 S, 36 Cl, 5, Cr, 57 Co, 58 Co, S9 Fe, 90 Y, l25 I, 131 I, and ,86 Re. 

Enzyme labels are likewise useful, and can be detected by any of the presently utilized 
colorimetric, spectrophotometric, fhiorospectrophotometric, amperometric or gasometric 
techniques. The enzyme is conjugated to the selected particle by reaction with bridging 
20 molecules such as carbodiimides, diisocyanates, glutaraldehyde and the like. Many 
enzymes which can be used in these procedures are known and can be utilized. The 
preferred are peroxidase, B-glucuronidase, B-D-glucosidase, B-D-galactosidase, urease, 
glucose oxidase plus peroxidase and alkaline phosphatase. U.S. Patent Nos. 3,654,090; 
3,850,752; and 4,016,043 are referred to by way of example for their disclosure of 
25 alternate labeling material and methods. 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specialist may be prepared to determine the presence or absence of 
predetermined binding activity or predetermined binding activity capability to suspected 
30 target cells. In accordance with the testing techniques discussed above, one class of such 
kits will contain at least the labeled polypeptide or its binding partner, for instance an 
antibody specific thereto, and directions, of course, depending upon the method selected. 
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e.g., "competitive," "sandwich," "DASP" and the like. The kits may also contain 
peripheral reagents such as buffers, stabilizers, etc. 

Accordingly, a test kit may be prepared for the demonstration of the presence or 
capability of cells for predetermined bacterial binding activity, comprising: 

(a) a predetermined amount of at least one labeled immunochemically reactive 
component obtained by the direct or indirect attachment of the present the polypeptide 
or a specific binding partner thereto, to a detectable label; 

(b) other reagents; and 

(c) directions for use of said kit. 

This invention provides antagonist or blocking agents which include but are not limited 
to. peptide fragments, mimetics, a nucleic acid molecule, a ribozyme, a polypeptide, a 
small molecule, a carbohydrate molecule, a monosaccharide, an oligosaccharide or an 
antibody. Also, agents which competitively block or inhibit pneumococcal bacterium are 
contemplmated by this invention. This invention provides an agent which comprises an 
inorganic compound, a nucleic acid molecule, an oligonucleotide, an organic compound, 
a peptide, a peptidomimetic compound, or a protein which inhibits the polypeptide. 

This invention provides a vaccine which comprises the polypeptide having the amino 
acid sequence as set forth in any of SEQ ID NOS: 1, 3-7, 9-11, 22, and 23 and a 
pharmaceutically acceptable adjuvant or earner. The polypeptide may comprise an 
amino acid sequence of a N-terminal choline binding protein A truncate as set forth in 
Figure 2. This invention provides a vaccine which comprises the polypeptide having the 
amino acid sequence which comprises a conserved region as set forth in Figure 2 and a 
pharmaceutically acceptable adjuvant or earner. For example, conserved regions include 
but are not limited to amino acid sequence 158 to 172; 300 to 321; 331 to 339; 355 to 
365; 367 to 374; 379 to 389; 409 to 427; and 430 to 447. This invention provides a 
vaccine comprising the isolated nucleic acid encoding the polypeptide and a 
pharmaceutically acceptable adjuvant or carrier. 

Active immunity against Gram positive bacteria, particularly pneumococcus, can be 
induced by immunization (vaccination) with an immunogenic amount of the polypeptide. 
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or peptide derivative or fragment thereof, and an adjuvant, wherein the polypeptide, or 
antigenic derivative or fragment thereof, is the antigenic component of the vaccine. 

The polypeptides of the present invention, or derivatives or fragments thereof, can be 
5 prepared in an admixture with an adjuvant to prepare a vaccine. Preferably, the 
derivative or fragment thereof, used as the antigenic component of the vaccine is an 
adhesin. More preferably, the polypeptide or peptide derivative or fragment thereof, 
used as the antigenic component of the vaccine is an antigen common to all or many 
strains of a species of Gram positive bacteria, or common to closely related species of 
10 bacteria. Most preferably, the antigenic component of the vaccine is an adhesin that is 
a common antigen. 

Vectors containing the nucleic acid-based vaccine of the invention can be introduced into 
the desired host by methods known in the art, e.g., transfection, electroporation, 

1 5 microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, 

lipofection (lysosome fusion), use of a gene gun, or a DNA vector transporter (see, e.g., 
Wu et al., 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 
263:14621-14624; Hartmut et al., Canadian Patent Application No. 2,012,311, filed 
March 15, 1990). 

20 

The vaccine can be administered via any parenteral route, including but not limited to 
intramuscular, intraperitoneal, intravenous, and the like. Preferably, since the desired 
result of vaccination is to elucidate an immune response to the antigen, and thereby to the 
pathogenic organism, administration directly, or by targeting or choice of a viral vector, 
25 indirectly, to lymphoid tissues, e.g., lymph nodes or spleen, is desirable. Since immune 
cells are continually replicating, they are ideal target for retroviral vector-based nucleic 
acid vaccines, since retroviruses require replicating cells. 

Passive i mm unity can be conferred to an animal subject suspected of suffering an 
30 infection with a Gram positive bacterium, preferably streptococcal, and more preferably 
pneumoccal, by administering antiserum, polyclonal antibodies, or a neutralizing 
monoclonal antibody against a polypeptide of the invention to the patient. Although 
passive immunity does not confer long term protection, it can be a valuable tool for the 
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treatment of a bacterial infection of a subject who has not been vaccinated. Passive 
immunity is particularly important for the treatment of antibiotic resistant strains of Gram 
positive bacteria, since no other therapy may be available. Preferably, the antibodies 
administered for passive immune therapy are autologous antibodies. For example, if the 
subject is a human, preferably the antibodies are of human origin or have been 
humanized," in order to minimize the possibility of an immune response against the 
antibodies. The active or passive vaccines of the invention, or the administration of an 
adhesin, can be used to protect an animal subject from infection of a Gram positive 
bacteria, preferably streptococcus, and more preferably, pneumococcus. 

This invention provides a pharmaceutical composition comprising an amount of the 
polypeptide as described and a pharmaceutically acceptable carrier or diluent. 


For example, such pharmaceutical composition for preventing pneumococcal attachment 
to mucosal surface may include antibody to lectin domain and/or soluble excess lectin 
domain proteins. Blocking adherence by either mechanism blocks the initial step in 
infection thereby reducing colonization. This in turn decreases person to person 
transmission and prevents development of symptomatic disease. 

This invention provides a method of inducing an immune response in a subject which has 
been exposed to or infected with a pneumococcal bacterium comprising administering 
to the subject an amount of the pharmaceutical composition, thereby inducing an immune 
response. 


This invention provides a method for preventing infection by a pneumococcal bacterium 
in a subject comprising administering to the subject an amount of the pharmaceutical 
composition effective to prevent pneumococcal bacterium attachment, thereby preventing 
infection by a pneumococcal bacterium. 

This invention provides a method for preventing infection by a pneumococcal bacterium 
in a subject comprising administering to the subject an amount of a pharmaceutical 
composition comprising the antibody and a pharmaceutically acceptable carrier or 
diluent, thereby preventing infection by a pneumococcal bacterium. 
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This invention provides a method for treating a subject infected with or exposed to 
pneumococcal bacterium comprising administering to the subject a therapeutically 
effective amount of a vaccine of the invention, thereby treating the subject. 

5 This invention provides a method of inhibiting colonization of host cells in a subject 
which has been exposed to or infected with a pneumococcal bacterium comprising 
administering to the subject an amount of the pharmaceutical composition comprising 
the polypeptide consisting of the amino acid sequence as set forth in any of SEQ ID NOS 
1, 3-5, 7, or 9-1 1, thereby inducing an immune response. The therapeutic peptide that 
10 blocks colonization is delivered by the respiratory mucosa. The pharmaceutical 
composition comprising the polypeptide consisting of the amino acid sequence as set 
forth in Figure 2. 

As used herein, “pharmaceutical composition” could mean therapeutically effective 
15 amounts of polypeptide products of the invention together with suitable diluents, 
preservatives, solubilizers, emulsifiers, adjuvant and/or carriers useful in to confer a 
therapeutic effect or benefit by e.g. preventing pneumococcal colonization. A 
"therapeutically effective amount" as used herein refers to that amount which provides 
a therapeutic effect for a given condition and administration regimen. Such compositions 
20 are liquids or lyophilized or otherwise dried formulations and include diluents of various 
buffer content (e.g., Tris-HCl., acetate, phosphate), pH and ionic strength, additives such 
as albumin or gelatin to prevent absorption to surfaces, detergents (e.g.. Tween 20, 
Tween 80, Pluronic F68, bile acid salts), solubilizing agents (e.g., glycerol, polyethylene 
glycerol), anti-oxidants (e.g., ascorbic acid, sodium metabisulfite), preservatives (e.g., 
25 Thimerosal, benzyl alcohol, parabens), bulking substances or tonicity modifiers (e.g., 
lactose, mannitol), covalent attachment of polymers such as polyethylene glycol to the 
protein, complexation with metal ions, or incorporation of the material into or onto 
particulate preparations of polymeric compounds such as polylactic acid, polglycolic 
acid, hydrogels, etc, or onto liposomes, microemulsions, micelles, unilamellar or 
30 multilamellar vesicles, erythrocyte ghosts, or spheroplasts. Such compositions will 
influence the physical state, solubility, stability, rate of in vivo release, and rate of in vivo 
clearance of the therapeutic agent. The choice of compositions will depend on the 
physical and chemical properties of the protein having therapeutic activity. For example. 
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a product derived from a membrane-bound form of the active may require a formulation 
containing detergent. Controlled or sustained release compositions include formulation 
m lipophilic depots (e.g., fatty acids, waxes, oils). Also comprehended by the invention 
are particulate compositions coated with polymers (e.g., poloxamers or poloxamines) and 
5 actives coupled to antibodies directed against tissue-specific receptors, ligands or 
antigens or coupled to ligands of tissue-specific receptors. Other embodiments of the 
compositions of the invention incorporate particulate forms protective coatings, protease 
inhibitors or permeation enhancers for various routes of administration, including 
parenteral, pulmonary, nasal and oral. 

10 

Further, as used herein “pharmaceutically acceptable carrier” are well known to those 
skilled m the art and include, but are not limited to, 0.01-0.1M and preferably 0.05M 
phosphate buffer or 0.8% saline. Additionally, such pharmaceutically acceptable carriers 
may be aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of 
1 5 non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as 

olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers include 
water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and 
buffered media. Parenteral vehicles include sodium chloride solution. Ringer’s dextrose, 
dextrose and sodium chloride, lactated Ringer’s or fixed oils. Intravenous vehicles 
20 include fluid and nutrient replenishers, electrolyte replenishes such as those based on 
Ringer’s dextrose, and the like. Preservatives and other additives may also be present, 

such as, for example, antimicrobials, antioxidants, collating agents, inert gases and the 
like. 


25 The term "adjuvant" refers to a compound or mixture that enhances the immune response 
to an antigen. An adjuvant can serve as a tissue depot that slowly releases the antigen 
and also as a lymphoid system activator that non-specifically enhances the immune 
response (Hood et al.. Immunology. Second Ed., 1984, Benjamin/Cummings: Menlo 
Park, California, p. 384). Often, a primary challenge with an antigen alone, in the 
30 absence of an adjuvant, will fail to elicit a humoral or cellular immune response. 
Adjuvant include, but are not limited to, complete Freund’s adjuvant, incomplete Freund’s 
adjuvant, saponin, mineral gels such as aluminum hydroxide, surface active substances 
such as lysolecithin, pluronic polyols, polyanions, peptides, oil or hydrocarbon 
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emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful human 
adjuvant such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. 
Preferably, the adjuvant is pharmaceutically acceptable. 

5 Controlled or sustained release compositions include formulation in lipophilic depots 
(e.g. fatty acids, waxes, oils). Also comprehended by the invention are particulate 
compositions coated with polymers (e.g. poloxamers or poloxamines) and the compound 
coupled to antibodies directed against tissue-specific receptors, ligands or antigens or 
coupled to ligands of tissue-specific receptors. Other embodiments of the compositions 
10 of the invention incorporate particulate forms protective coatings, protease inhibitors or 
permeation enhancers for various routes of administration, including parenteral, 
pulmonary, nasal and oral. 

When administered, compounds are often cleared rapidly from mucosal surfaces or the 
15 circulation and may therefore elicit relatively short-lived pharmacological activity. 
Consequently, frequent administrations of relatively large doses of bioactive compounds 
may by required to sustain therapeutic efficacy. Compounds modified by the covalent 
attachment of water-soluble polymers such as polyethylene glycol, copolymers of 
polyethylene glycol and polypropylene glycol, carboxymethyl cellulose, dextran, 
20 polyvinyl alcohol, polyvinylpyrrolidone or polyproline are known to exhibit 
substantially longer half-lives in blood following intravenous injection than do the 
corresponding unmodified compounds (Abuchowski et al., 1981; Newmark et al., 1982, 
and Katre et al., 1987). Such modifications may also increase the compound's solubility 
in aqueous solution, eliminate aggregation, enhance the physical and chemical stability 
25 of the compound, and greatly reduce the immunogenicity and reactivity of the compound. 
As a result, the desired in vivo biological activity may be achieved by the administration 
of such polymer-compound abducts less frequently or in lower doses than with the 
unmodified compound. 

30 Dosages. The sufficient amount may include but is not limited to from about 1 pg/kg to 
about 1000 mg/kg. The amount may be 10 mg/kg. The pharmaceutically acceptable 
form of the composition includes a pharmaceutically acceptable earner. 
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As noted above, the present invention provides therapeutic compositions comprising 
pharmaceutical compositions comprising vectors, vaccines, polypeptides, nucleic acids 
and antibodies, anti-antibodies, and agents, to compete with the pneumococcus bacterium 
for pathogenic activities, such as adherence to host cells. 


The preparation of therapeutic compositions which contain an active component is well 
understood m the art. Typically, such compositions are prepared as an aerosol of the 
polypeptide delivered to the nasopharynx or as injectables, either as liquid solutions or 
suspensions, however, solid forms suitable for solution in, or suspension in, liquid prior 
to injection can also be prepared. The preparation can also be emulsified. The active 
therapeutic ingredient is often mixed with excipients which are pharmaceutically 
acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. 
In addition, if desired, the composition can contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents which enhance the 
effectiveness of the active ingredient. 

An active component can be formulated into the therapeutic composition as neutralized 
pharmaceutically acceptable salt forms. Pharmaceutically acceptable salts include the 
acid addition salts (formed with the free amino groups of the polypeptide or antibody 
molecule) and which are formed with inorganic acids such as, for example, hydrochloric 
or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the 
like. Salts formed from the free carboxyl groups can also be derived from inorganic 
bases such as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino 
ethanol, histidine, procaine, and the like. 

A composition comprising "A" (where "A" is a single protein, DNA molecule, vector, 
etc.) is substantially free of "B" (where "B" comprises one or more contaminating 
proteins, DNA molecules, vectors, etc.) when at least about 75% by weight of the 
proteins, DNA, vectors (depending on the category of species to which A and B belong) 
in the composition is "A". Preferably, "A" comprises at least about 90% by weight of the 
A+B species in the composition, most preferably at least about 99% by weight 
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The phrase "therapeutically effective amount" is used herein to mean an amount 
sufficient to reduce by at least about 15 percent, preferably by at least 50 percent, more 
preferably by at least 90 percent, and most preferably prevent, a clinically significant 
deficit in the activity, function and response of the host. Alternatively, a therapeutically 
5 effective amount is sufficient to cause an improvement in a clinically significant 
condition in the host. In the context of the present invention, a deficit in the response of 
the host is evidenced by continuing or spreading bacterial infection. An improvement 
in a clinically significant condition in the host includes a decrease in bacterial load, 
clearance of bacteria from colonized host cells, reduction in fever or inflammation 
1 0 associated with infection, or a reduction in any symptom associated with the bacterial 
infection. 

According to the invention, the component or components of a therapeutic composition 
of the invention may be introduced parenterally, transmucosally, e.g., orally, nasally, 

1 5 pulmonarailly, or rectally, or transdermally. Preferably, administration is parenteral, e.g., 

via intravenous injection, and also including, but is not limited to, intra-arterial, 
intramuscular, intradermal, subcutaneous, intraperitoneal, intraventricular, and 
intracranial administration. Oral or pulmonary delivery may be preferred to activate 
mucosal i mm unity; since pneumococci generally colonize the nasopharyngeal and 
20 pulmonary mucosa, mucosal immunity may be a particularly effective preventive 
treatment. The term "unit dose" when used in reference to a therapeutic composition of 
the present invention refers to physically discrete units suitable as unitary dosage for 
humans,, each unit containing a predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with the required diluent; i.e . , carrier, 
25 or vehicle. 


In another embodiment, the active compound can be delivered in a vesicle, in particular 
a liposome (see Langer, Science 249:1527-1533 (1990); Treat et al„ in Liposomes in the 
Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler (eds.), Liss, New 
30 York, pp. 353-365 (1989); Lopez-Berestein, ibid., pp. 317-327; see generally ibid). 

In yet another embodiment, the therapeutic compound can be delivered in a controlled 
release system. For example, the polypeptide may be administered using intravenous 
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infusion, an implantable osmotic pump, a transdeimal patch, liposomes, or other modes 
of administration. In one embodiment, a pump may be used (see Langer, supra; Sefton, 
CRC Crit - Ref - Biome d- Eng. 14:201 (1987); Buchwald et al„ Surgery 88:507 (1980); 
Saudek et al„ N. Engl. J. Med. 321:574 (1989)). In another embodiment, polymeric 
5 materials can be used (see Medical Applications of Controlled Release, Langer and Wise 

(eds.), CRC Pres., Boca Raton, Florida (1974); Controlled Drug Bioavailability, Drug 
Product Design and Performance, Smolen and Ball (eds.), Wiley, New York (1984); 
Ranger and Peppas, J. Macromol. Sci. Rev. Macromol. Chem. 23:61 (1983); see also 
Levy et al.. Science 228:190 (1985); During et al., Ann. Neurol. 25:351 (1989); Howard 
10 et al., J. Neurosurg. 71:105 (1989)). In yet another embodiment, a controlled release 
system can be placed in proximity of the therapeutic target, i.e., the brain, thus requiring 
only a fraction of the systemic dose (see, e.g., Goodson, in Medical Applications of 
Controlled Release, supra, vol. 2, pp. 115-138 (1984)). Preferably, a controlled release 
device is introduced into a subject in proximity of the site of inappropriate immune 
15 activation or a tumor. Other controlled release systems are discussed in the review by 
Langer (Science 249:1527-1533 (1990)). 

A subject in whom administration of an active component as set forth above is an 
effective therapeutic regimen for a bacterial infection is preferably a human, but can be 
20 any animal. Thus, as can be readily appreciated by one of ordinaiy skill in the art, the 
methods and pharmaceutical compositions of the present invention are particularly suited 
to administration to any animal, particularly a mammal, and including, but by no means 
limited to, domestic animals, such as feline or canine subjects, farm animals, such as but 
not limited to bovine, equine, caprine, ovine, and porcine subjects, wild animals (whether 
25 in the wild or in a zoological garden), research animals, such as mice, rats, rabbits, goats, 
sheep, pigs, dogs, cats, etc., i.e., for veterinary medical use. 

In the therapeutic methods and compositions of the invention, a therapeutically effective 
dosage of the active component is provided. A therapeutically effective dosage can be 
30 detenmined by the otdhuuy skilled medical worker based on patient characteristics (age, 
weight, sex. condition, complications, other diseases, etc.), as is well known in the art. 
Furthermore, as further routine studies are conducted, more specific information will 
emerge regarding appropriate dosage levels for treatment of various conditions in various 
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patients, and the ordinary skilled worker, considering the therapeutic context, age and 
general health of the recipient, is able to ascertain proper dosing. Generally, for 
intravenous injection or infusion, dosage may be lower than for intrapentoneal, 
intramuscular, or other route of administration. The dosing schedule may vary, 

5 depending on the circulation half-life, and the formulation used. The compositions are 
administered in a manner compatible with the dosage formulation in the therapeutically 
effective amount. Precise amounts of active ingredient required to be administered 
depend on the judgment of the practitioner and are peculiar to each individual. However, 
suitable dosages may range from about 0.1 to 20, preferably about 0.5 to about 10, and 
10 more preferably one to several, milligrams of active ingredient per kilogram body weight 
of individual per day and depend on the route of administration. Suitable regimes for 
initial administration and booster shots are also variable, but are typified by an initial 
administration followed by repeated doses at one or more hour intervals by a subsequent 
injection or other administration. Alternatively, continuous intravenous infusion 
1 5 sufficient to maintain concentrations often nanomolar to ten micromolar in the blood are 
contemplated. 

Administration with other compounds. For treatment of a bacterial infection, one may 
administer the present active component in conjunction with one or more pharmaceutical 
20 compositions used for treating bacterial infection, including but not limited to (1) 
ant ibiotics; (2) soluble carbohydrate inhibitors of bacterial adhesin; (3) other small 
molecule inhibitors of bacterial adhesin; (4) inhibitors of bacterial metabolism, 
transport, or transformation; (5) stimulators of bacterial lysis, or (6) anti-bacterial 
antibodies or vaccines directed at other bacterial antigens. Other potential active 
25 components include anti-inflammatory agents, such as steroids and non-steroidal anti- 
inflammatory drngs. Administration may be simultaneous (for example, administration 
of a mixture of the present active component and an antibiotic), or may be in seriatim. 

Accordingly, in specific embodiment, the therapeutic compositions may further include 
30 an effective amount of the active component, and one or more of the following active 
ingredients: an antibiotic, a steroid, etc. Exemplary formulations are given below: 

Formulations 
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Intravenous Form ulation I 
Ingredient 
cefotaxime 
Polypeptide 
5 dextrose USP 

sodium bisulfite USP 
edetate disodium USP 
water for injection q.s.a.d. 

10 Intravenou s Formulation n 
Ingredient 
ampicillin 
Polypeptide 
sodium bisulfite USP 
1 5 disodium edetate USP 
water for injection q.s.a.d. 

Intravenous Fo rmulation TTT 
Ingredient 

20 gentamicin (charged as sulfate) 
Polypeptide 
sodium bisulfite USP 
disodium edetate USP 

water for injection q.s.a.d. 

25 

Intravenous Fr. Trnll l a tinn TV 

Ingredient 

Polypeptide 

dextrose USP 

30 sodium bisulfite USP 
. edetate disodium USP 
water for injection q.s.a.d. 


-47- 


mg/ml 

250.0 

10.0 

45.0 
3.2 
0.1 

1.0 ml 


mg/ml 

250.0 

10.0 
3.2 
0.1 

1.0 ml 


mg/ml 

40.0 

10.0 
3.2 
0.1 

1.0 ml 


mg/ml 

10.0 

45.0 
3.2 
0.1 

1.0 ml 
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Intravenous Formul ation V 
Tnpredient 

Polypeptide antagonist 
sodium bisulfite USP 
5 disodium edetate USP 
water for injection q.s.a.d. 

Thus, in a specific instance where it is desired to reduce or inhibit the infection resulting 
from a bacterium mediated binding of bacteria to a host cell, or an antibody thereto, or 
10 a ligand thereof or an antibody to that ligand, the polypeptide is introduced to block the 

interaction of the bacteria with the host cell. 

Also contemplated herein is pulmonary delivery of the present polypeptide having lectin 
activity which acts as an adhesin inhibitory agent (or derivatives thereof), of the 
15 invention. The adhesin inhibitory agent (or derivative) is delivered to the lungs of a 
mammal, where it can interfere with bacterial, i.e., streptococcal, and preferably 
pneumococcal binding to host cells. Other reports of preparation of proteins for 
pulmonary delivery are found in the art [Adjei et al. Pharmaceutical Research, 7:565-569 
(1990); Adjei et al.. International Journal of Pharmaceutics, 63:135-144 (1990) 
20 (leuprolide acetate); Braquet et al.. Journal of Cardiovascular Pharmacology, 
13(suppl. 5): 143-146 (1989) (endothelin-1); Hubbard et al.. Annals of Internal Medicine, 
Vol. Ill, pp. 206-212 (1989) (a 1 -antitrypsin); Smith et al., J. Clin. Invest. 84:1 145-1 146 
( 1989). (a- 1 -proteinase); Oswein et al., "Aerosolization of Proteins", Proceedings of 
Symposium on Respiratory Drug Delivery II, Keystone, Colorado, March, (1990) 
25 (recombinant human growth hormone); Debs et al., J Immunol. 140:3482-3488 (1988) 
(interferon-y and tumor necrosis factor alpha); Plate et al., U.S. Patent No. 5,284,656 
(granulocyte colony stimulating factor)]. A method and composition for pulmonary 
delivery of drugs is described in U.S. Patent No. 5,451,569, issued September 19, 1995 

to Wong et al. 

30 

All such devices require the use of formulations suitable for the dispensing of adhesin 
inhibitory agent (or derivative). Typically, each formulation is specific to the type of 
device employed and may involve the use of an appropriate propellant material, m 


mg/ml 

5.0 
3.2 
0.1 

1.0 ml 
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addition ,o the usual diluents, adjuvant and/or earners useful in therapy. Also, the use 

of liposomes, mictocapsules or microspheres, inclusion complexes, or other types of 

earners is contemplated. Chemically modified adhesin inhibitory agent may also be 

prepared tn different fommlations depending on the type of chemical modification or the 
5 type of device employed. 


Formulations suitable for use with a nebulizer, either jet or ultrasonic, will typically 
compose adhesin inhibitory agent (or derivative) dissolved in water at a concentration 
of about 0.1 to 25 mg of biologically active adhesin inhibitory agent per ml of solution. 

10 The formulation may also inclnde a buffer and a simple sugar (e.g„ for adhesin inhibitor 

agent stabilization ami regulation of osmotic pressure). The nebulizer foimulation may 
also contain a surfactant, to reduce or prevent surface induced aggregation of the adhesin 
inhibitory agent caused by atomization of the solution in forming the aerosol. 


15 


20 


Formulations for use with a metered-dose inhaler device will generally comprise a finely 
divided powder containing the adhesin inhibitory agent (or derivative) suspended in a 
propellant with the aid of a surfactant. The propellant may be any conventional material 
employed for this putpose, such as a chlorofluorocarbon, a hydrochlorofluorocatbon a 
hydro fluorocarbon, or a hydmearbon, including trichlorofluoromethan’e, 
dichlorodifluoromethane, dichlorotefrafluoroethanol, and 1,1,1,2-tctrafluoroethane, or 
combinations thereof. Suitable surfactants include sorbitan trioleate and soya lecithin 
Oleic acid may also be useful as a surfactant. 


The liquid aerosol formulations contain adhesin inhibitory agent and a dispersing agent 
25 in a physiologically acceptable diluent. The thy powder aerosol fommlations of the 
present invention consist of a finely divided solid form of adhesin inhibitory agent and 
a dispersing agent. With either the liquid or dty powder aerosol fotmulafion, the 
foimulation must be aerosolized. That is, i, must be broken down into liquid or solid 
paritc.es in order to ensure that the aerosolized dose actually reaches the mucous 
membranes of die nasal passages or the lung. The tem, "aerosol particle" is used herein 
to describe the hquid or solid particle suitable for nasal or pulmomuy administration, / e 
that will reach the mucous membranes. Other considerations, such as constiuction of the 
delivery device, additional components in the fotmulation, and particle characteristics are 
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important. These aspects of pulmonary administration of a drug are well known m the 
art, and manipulation of formulations, aerosolization means and construction of a 
delivery device require at most routine experimentation by one of ordinary skill in the art. 

In a particular embodiment, the mass median dynamic diameter will be 5 micrometers 
5 or less in order to ensure that the drug particles reach the lung alveoli [Wearley, L.L., 
Crit. Rev. in Ther. Drug Carrier Systems 8:333 (1991)]. 

Systems of aerosol delivery, such as the pressurized metered dose inhaler and the dty 
powder inhaler ate disclosed in Newman, S.P., Aerosols and the Lung, Clarke, S.W. and 
10 Davia,D. editors, pp. 197-22 and can be used in connection with the present invention. 

In a further embodiment, as discussed in detail infra, an aerosol formulation of the 
present invention can include other therapeutically or pharmacologically active 
ingredients in addition to adhesin inhibitory agent, such as but not limited to an 
1 5 antibiotic, a steroid, a non-steroidal anti-inflammatory drug, etc. 

Liquid Aerosol Formulations. The present invention provides aerosol formulations and 
dosage forms for use in treating subjects suffering from bacterial, e.g., streptococcal, in 
particularly pneumococcal, infection. In general such dosage forms contain adhesin 
20 inhibitory agent in a pharmaceutically acceptable diluent. Pharmaceutically acceptable 
diluents include but are not limited to sterile water, saline, buffered saline, dextrose 
solution, and the like. In a specific embodiment, a diluent that may be used in the present 
invention or the pharmaceutical formulation of the present invention is phosphate 
buffered saline, or a buffered saline solution generally between the pH 7.0-8.0 range, or 

25 water. 

The liquid aerosol formulation of the present invention may include, as optional 
ingredients, pharmaceutically acceptable carriers, diluents, solubilizing or emulsifying 

agents, surfactants and excipients. The formulation may include a carrier. The carrier 

30 is a macromolecule which is soluble in the circulatory system and which is 
physiologically acceptable where physiological acceptance means that those of skill in 
the art would accept injection of said carrier into a patient as part of a therapeutic regime. 
The carrier preferably is relatively stable in the circulatory system with an acceptable 
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plasma half life for clearance. Such macromolecules include but are not limited to Soya 
lecithin, oleic acid and sorbitan trioleate, with sorbitan trioleate preferred. 

The formulations of the present embodiment may also include other agents useful for pH 
5 maintenance, solution stabilization, or for the regulation of osmotic pressure. Examples 

of the agents include but are not limited to salts, such as sodium chloride, or potassium 
chloride, and carbohydrates, such as glucose, galactose or mannose, and the like. 

The present invention further contemplates liquid aerosol formulations comprising 
1 0 adhesin inhibitory agent and another therapeutically effective drug, such as an antibiotic, 
a steroid, a non-steroidal anti-infl ammat ory drug, etc. 

Aerosol Dry Powder Formulations. It is also contemplated that the present aerosol 
formulation can be prepared as a dry powder formulation comprising a finely divided 
1 5 powder form of adhesin inhibitory agent and a dispersant. 

Formulations for dispensing from a powder inhaler device will comprise a finely divided 
dry powder containing adhesin inhibitory agent (or derivative) and may also include a 
bulking agent, such as lactose, sorbitol, sucrose, or mannitol in amounts which facilitate 
20 dispersal of the powder from the device, e g . , 50 to 90% by weight of the formulation. 
The adhesin inhibitory agent (or derivative) should most advantageously be prepared in 
particulate form with an average particle size of less than 10 mm (or microns), most 
preferably 0.5 to 5 mm, for most -effective delivery to the distal lung. In another 
embodiment, the dry powder formulation can comprise a finely divided dry powder 
25 containing adhesin inhibitory agent, a dispersing agent and also a bulking agent. Bulking 
agents useful in conjunction with the present formulation include such agents as lactose, 

sorbitol, sucrose, or mannitol, in amounts that facilitate the dispersal of the powder from 
the device. 

30 The present invention further contemplates dry powder formulations comprising adhesin 
inhibitory agent and another therapeutically effective drug, such as an antibiotic, a 
steroid, a non-steroidal anti-inflammatory drug, etc. 
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Contemplated for use herein are oral solid dosage forms, which are described generally 
in Remington’s Pharmaceutical Sciences, 18th Ed. 1990 (Mack Publishing Co. Easton 
PA 18042) at Chapter 89, which is herein incorporated by reference. Solid dosage forms 
include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, liposomal 
5 or proteinoid encapsulation may be used to formulate the present compositions (as, for 
example, proteinoid microspheres reported in U.S. Patent No. 4,925,673). Liposomal 
encapsulation may be used and the liposomes may be derivatized with various polymers 
(e.g., U.S. Patent No. 5,013,556). A description of possible solid dosage forms for the 
therapeutic is given by Marshall, K. In: Modern Pharmaceutics Edited by G.S. Banker 
10 and C.T. Rhodes Chapter 10, 1979, herein incorporated by reference. In general, the 
formulation will include the component or components (or chemically modified forms 
thereof) and inert ingredients which allow for protection against the stomach 
environment, and release of the biologically active material in the intestine. 

1 5 Also specifically contemplated are oral dosage forms of the above derivatized component 
or components. The component or components may be chemically modified so that oral 
delivery of the derivative is efficacious. Generally, the chemical modification 
contemplated is the attachment of at least one moiety to the component molecule itself, 
where said moiety permits (a) inhibition of proteolysis; and (b) uptake into the blood 
20 stream from the stomach or intestine. Also desired is the increase in overall stability of 
the component or components and increase in circulation time in the body. Examples of 
such moieties include: polyethylene glycol, copolymers of ethylene glycol and propylene 
glycol, carboxymethyl cellulose, dextran, polyvinyl alcohol, polyvinyl pyrrolidone and 
polyproline. Abuchowski and Davis, 1981, "Soluble Polymer-Enzyme Adducts" In: 
25 Enzymes as Drugs, Hocenberg and Roberts, eds., Wiley-Interscience, New York, NY , pp. 

367-383; Newmark, et al., 1982, J. Appl. Biochem. 4:185-189. Other polymers that 
could be used are poly-1, 3-dioxolane and poly-l,3,6-tioxocane. Preferred for 
pharmaceutical usage, as indicated above, are polyethylene glycol moieties. 

30 For the component (or derivative) the location of release may be the stomach, the small 
intestine (the duodenum, the jejunem, or the ileum), or the large intestine. One skilled 
in the art has available formulations which will not dissolve in the stomach, yet will 
release the material in the duodenum or elsewhere in the intestine. Preferably, the release 
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wiH avoid tbe deleterious effects of the stomach environment, either by protection of the 
protein (or derivative) or by release of the biologically active material beyond the 
stomach environment, such as in the intestine. 


5 To ensure full gastric resistance a coating impermeable to at least pH 5.0 is essential. 
Examples of the more common inert ingredients that are used as enteric coatings are 
cellulose acetate trimellitate (CAT), hydroxypropylmethylcellulose phthalate (HPMCP), 
HPMCP 50, HPMCP 55, polyvinyl acetate phthalate (PVAP), Eudragit L30D, Aquateric, 
cellulose acetate phthalate (CAP), Eudragit L, Eudragit S, and Shellac. These coatings 
1 0 may be used as mixed fi lms 

A coating or mixture of coatings can also be used on tablets, which are not intended for 
protection against the stomach. This can include sugar coatings, or coatings which make 
the tablet easier to swallow. Capsules may consist of a hard shell (such as gelatin) for 
1 5 delivery of diy therapeutic i.e. powder; for liquid forms, a soft gelatin shell may be used. 

The shell material of cachets could be thick starch or other edible paper. For pills, 
lozenges, molded tablets or tablet triturates, moist massing techniques can be used. 

The peptide therapeutic can be included in the formulation as fine multiparticulates in the 
20 form of granules or pellets of particle size about 1mm. The formulation of the material 
for capsule administration could also be as a powder, lightly compressed plugs or even 
as tablets. The therapeutic could be prepared by compression. 

Colorants and flavoring agents may all be included. For example, the protein (or 
25 derivative) may be formulated (such as by liposome or microsphere encapsulation) and 
then further contained within an edible product, such as a refrigerated beverage 
containing colorants and flavoring agents. 

One may dilute or increase the volume of the therapeutic with an inert material. These 
30 diluents could include carbohydrates, especially mannitol, a-lactose, anhydrous lactose, 
cellulose, sucrose, modified dextran and starch. Certain inorganic salts may be also be 
used as fillers including calcium triphosphate, magnesium carbonate and sodium 
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chloride. Some commercially available diluents are Fast-Flo, Emdex, STA-Rx 1 500, 
Emcompress and Avicell. 

Disintegrants may be included in the formulation of the therapeutic into a solid dosage 
5 form. Materials used as disintegrates include but are not limited to starch, including the 
commercial disintegrant based on starch, Explotab. Sodium starch glycolate, Amberlite, 
sodium carboxymethylcellulose, ultramylopectin, sodium alginate, gelatin, orange peel, 
acid carboxymethyl cellulose, natural sponge and bentonite may all be used. Another 
form of the disintegrants are the insoluble cationic exchange resins. Powdered gums may 
1 0 be used as disintegrants and as binders and these can include powdered gums such as 
agar, Karaya or tragacanth. Alginic acid and its sodium salt are also useful as 
disintegrants. Binders may be used to hold the therapeutic agent together to form a hard 
tablet and include materials from natural products such as acacia, tragacanth, starch and 
gelatin. Others include methyl cellulose (MC), ethyl cellulose (EC) and carboxymethyl 
15 cellulose (CMC). Polyvinyl pyrrolidone (PVP) and hydroxypropylmethyl cellulose 
(HPMC) could both be used in alcoholic solutions to granulate the therapeutic. 

An a nti fr ictional agent may be included in the formulation of the therapeutic to prevent 
sticking during the formulation process. Lubricants may be used as a layer between the 
20 therapeutic and the die wall, and these can include but are not limited to; stearic acid 
including its magnesium and calcium salts, polytetrafluoroethylene (PTFE), liquid 
paraffin, vegetable oils and waxes. Soluble lubricants may also be used such as sodium 
lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of vanous molecular 
weights, Carbowax 4000 and 6000. 

25 

Glidants that might improve the flow properties of the drug during formulation and to aid 
rearrangement during compression might be added. The glidants may include starch, 
talc, pyrogenic silica and hydrated silicoaluminate. 

30 To aid dissolution of the therapeutic into the aqueous environment a surfactant might be 
added as a wetting agent. Surfactants may include anionic detergents such as sodium 
lauryl sulfate, dioctyl sodium sulfosuccinate and dioctyl sodium sulfonate. Cationic 
detergents might be used and could include benzalkonium chloride or benzethomium 
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chloride. The list of potential nonionic detergents that could be included in the 
formulation as surfactants are lauromacrogol 400, polyoxyl 40 stearate, polyoxyethylene 
hydrogenated castor oil 10, 50 and 60, glycerol monostearate, polysorbate 40, 60, 65 and 
80, sucrose fatty acid ester, methyl cellulose and carboxymethyl cellulose. These 
5 surfactants could be present in the formulation of the protein or derivative either alone 
or as a mixture in different ratios. 

Additives which potentially enhance uptake of the polypeptide (or derivative) are for 
instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

10 

Pulmonary Delivery. Also contemplated herein is pulmonary delivery of the present 
polypeptide (or derivatives thereof). The polypeptide (or derivative) is delivered to the 
lungs of a mammal while inhaling and coats the mucosal surface of the alveoli. Other 
reports of this include Adjei et al., 1990, Pharmaceutical Research, 7:565-569; Adjei 
15 et al., 1990, International Journal of Pharmaceutics, 63:135-144 (leuprolide acetate); 
Braquet et al., 1989, Journal of Cardiovascular Pharmacology, 13(suppl. 5):143-146 
(endothelin-1); Hubbard et al., 1989, Annals of Internal Medicine, Vol. Ill, pp. 206-212 
(al- antitrypsin); Smith et al., 1989, J. Clin. Invest. 84:1145-1146 (a- 1 -proteinase); 
Oswein et al., 1990, "Aerosolization of Proteins", Proceedings of Symposium on 
20 Respiratory Drug Delivery II, Keystone, Colorado, March, (recombinant human growth 
hormone); Debs et al., 1988, J. Immunol. 140:3482-3488 (interferon-g and tumor 
necrosis factor alpha) and Plata et al., U.S. Patent No. 5,284,656 (granulocyte colony 
stimulating factor). A method and composition for pulmonary delivery of drugs for 
systemic effect is described in U.S. Patent No. 5,451,569, issued September 19, 1995 to 
25 Wong et al. 

Contemplated for use in the practice of this invention are a wide range of mechanical 
devices designed for pulmonary delivery of therapeutic products, including but not 
limited to nebulizers, metered dose inhalers, and powder inhalers, all of which are 
30 familiar to those skilled in the art. 

Formulations suitable for use with a nebulizer, either jet or ultrasonic, will typically 
comprise polypeptide (or derivative) dissolved in water at a concentration of about 0.1 
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,o 25 mg of biologically active protein per mL of solution. The formulation may also 
include a buffer and a simple sugar (eg, for protein stabilization and regulation of 
osmotic pressure). The nebulizer formulation may also contain a surfactant, to reduce 
or prevent surface induced aggregation of the protein caused by atomization of the 
5 solution in forming the aerosol. 

Formulations for use with a metered-dose inhaler device will generally comprise a finely 
divided powder containing the polypeptide (or derivative) suspended in a propellant with 
the aid of a surfactant. The propellant may be any conventional material employed for 
10 this purpose, such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a 
hydrofluorocarbon, or a hydrocarbon, including trichlorofluoromethane, 
dichlorodifluoromethane, dichlorotetrafluoroethanol, and 1 , 1 , 1 , 2 -tetrafluoroethane, or 
combinations thereof. Suitable surfactants include sorbitan trioleate and soya lecithin. 
Oleic acid may also be useful as a surfactant. 

15 

Formulations for dispensing from a powder inhaler device will comprise a finely divided 
dry powder containing polypeptide (or derivative) and may also include a bulking agent, 
such as lactose, sorbitol, sucrose, or mannitol in amounts which facilitate dispersal of the 
powder from the device, e.g., 50 to 90% by weight of the formulation. The protein (or 
20 derivative) should most advantageously be prepared in particulate form with an average 
particle size of less than 10 mm (or microns), most preferably 0.5 to 5 mm, for most 
effective delivery to the distal lung. 

Nasal Delivery. Nasal or nasopharyngeal delivery of the polypeptide (or derivative) is 
25 also contemplated. Nasal delivery allows the passage of the polypeptide directly over the 
upper respiratory tract mucosal after administering the therapeutic product to the nose, 
without the necessity for deposition of the product in the lung. Fotmulations for nasal 
delivery include those with dextran or cyclodextran. 

30 The following examples are presented in order to more fully illustrate the preferred 
embodiments of the invention. They should in no way be construed, however, as 
limiting the broad scope of the invention. 
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EX PERIMENTAT DETAILS SEr-Tirwr 

EXAMPLE 1: Peptide truncates of choline binding protein A (CbpA) 

5 A polypeptide comprising a truncated N-terminal fragment of the CbpA (serotype 4) was 

generated. Full length CbpA was amplified with PCR primers SJ533 and SJ537, the 
primeis were designed based on the derived N-teminal amino acid sequence of the CbpA 
polypeptide. 5' forward primer SJ533 = S' GGC GGA TCC ATG GA(A,G) AA(C,T) 
GA(A,G) GG 3'. This degenerate primer designed from the amino acid sequence 
10 XENEG, incorporates both BamHI and Ncol restriction sites and an ATG start codon. 
3' reverse primer SJ537 = 5'GCC GTC GAC TTA GTT TAC CCA TTC ACC ATT 
GGC 3'. This primer incorporates a Sail restriction site for cloning purposes, and the 

natural stop codon from CbpA, and is based on both type 4 and R6x sequence. 

15 PCR product was generated from genomic DNA as a template with primers SJ533 and 
SJ537 amplified 30 cycles with an annealing temperature of 50“C using High Fidelity 
enzyme (Boehringer Mannheim). The resulting PCR products were purified using 
QIAquick PCR Purification Kit (Qiagen, Inc.) then digested with BamHI and Sail 
restriction enzymes and cloned into the pQE30 expression vector (Qiagen, Inc.) digested 
with BamHI, Xbal, and Smal restriction enzymes. 


Polypeptide R2r 


The naturally occuring PvuII site at the end of the second repeat region, namely the C 
region as shown in Figure 1, (nucleic acid 1228 of Type 4 sequence) was exploited to 
create a trimeated version of the cbpA gene, containing only the 5' portion of the gene. 
To create the truncate clone, the full length clone PMI580 (Type 4) orPMI581(R6x) was 
digested with PvuII and Xbal, the resulting fragment was ligated into the expression 
vector, PQE30 and transformed into the appropriate host. The protein was expressed and 
purified. In this instance the stop codon utilized by the expression vector is downstream 
of the insert, so the protein expressed is larger than the predicted size of the insert due to 
additional nucleic acids at the S’ end of the cloning site. The amino acid sequence of 
polypeptide R2 is set forth in SEQ ID NO 1 
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Pnlyppptide Rl : 

A similar strategy was used to express oniy the to. repeat region within (he N-terminal 

5 region of CbpA, nantely tire A region of po.ypep.ide Rl. Here the amorally occrmng 
Xmnl site between die wo amino repeats (nucleic acid 856 of tire Type 4 sequence) was 
utilized. cbpA full length clone PMI580 was digested with Xmnl and Aa.II. Theveetor 
pQ E30 was digested with Aaffl and Smal. Once again the two srzed fragment were 
ligaled, uniformed into E. coli and Cones screened for insert. One positive Cone was 
10 selected and recombinant protein purified from this stain. 

"All polypeptides were expressed and purified with tire Qia Expression System (Qiagen) 
using an E. coli tire pQE30 vector. The amino temtinus of the His tagged proterns am 
detected by host and Western analysis using both anti-histidine antrbod.es and protetn 

1 5 specific antibodies. 


Purification of Rl and R2 : 

To induce production of and purify recombinant proteins from E. coli a single colony was 
20 selected from plated bacteria contenting tire recombinmr. plasmid and grown ovtenr^r 
a, 37 - in 6 0 mis LB buffer with 50pg/ml kanamycin and KHWml anrprcrlhn. Thrs 6.0 
nr, culture was added to 1L IB with antibiotics at above concentrations. The culture was 
shaken a. 37” C until A** - -0.400. 1M IPTG was added to the 1L culture torn, 
concentration of ImM. The culture was then shaken at 37”C for 3-4 hrs. ThelLcultere 
25 was spun for 15 min at 4000 rpnr rn a model J- 6 B centrifuge. The supernatant was 

discarded and the pellet stored at -20°C. 

The 1L pellet was re-suspended in 25 ml 50 mM NaH z PO., lOmM Tri, 6M GuCl 
300mM NaCl, pH 8.0 (Buffer A). This mixture was rotated a, room temperature for 30 

30 minutes and sonicated on a (VibraCell Sonicator (Sonics and Matenals. Inc., Dan ury, 

c in of sn% Cutv Cycle and with the output 
CT) using the micro tip, two times, for 30 secs, at 50 /o Cuty «.yci 

setting a. 7. The mixtine was spun 5 min a. 1 OK in a JA20 rotor and tire supematen. 
removed and discarded. The supematent was loaded onto a 10 ml Talon (Clonetech, Palo 
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Alto, CA) resin column attached to a GradiFrac System (Pharmacia Biotech, Upsala, 
Sweden). The column was equilibrated with 100 ml Buffer A and washed with an 
additional 200 ml of this buffer. A volume based pH gradient using 100% 50 mM 
NaH 2 P0 4 , 8M Urea, 20mM MES, pH6.0 (Buffer B) as the final target buffer was run 
over a total volume of 100 ml. Protein eluted at -30% Buffer B. Eluted peaks were 
collected and pooled. 

For refolding, dialysis was carried out with a 2L volume of PBS at room temperature for 
approximately 3 hrs using dialysis tubing with a molecular weight cutoff of 14,000. The 
sample was then dialyzed overnight in 2L of PBS at 4°C. Additional buffer exchange 
was accomplished during the concentration of the protein using Centriprep-30 spin 
columns by adding PBS to the spun retenate and re-spinning. The protein concentration 
was determined using the BCA protein assay and the purity visualized using a Coomassie 
stained 4-20% SDS-PAGE gel (Figure 3). 

EXAMPLE 2: Lectin activity of polypeptides R1 and R2 

LNnt is a carbohydrate analog of the receptors for pneumococci present on eukaryotic 
cells. It has been shown that a CbpA defective pneumococcal mutant failed to adhere to 
either eukaryotic cells or immobilized sugar indicating that CbpA is the adhesive ligand. 
CbpA is a modular protein that can be divided into two regions: the N-Terminal 
functional domain and the C-terminal choline binding domain (Figure 1). Polypeptides 
R1 and R2 were analyzed for biological activity to determine if the activities of the entire 
CbpA were localized in the unique N terminus (modelled by R2) or a fragment thereof 
(modelled by Rl). It was determined whether or not the N-terminal domain alone (R2) 
contained the lectin binding biological activity in the absence of the choline binding 
domarn (CBD). This was tested using the full length CbpA and polypeptide R2 (truncate 
missing the CBD region beyond the Pvu II site in the proline rich region). 

The assay was to coat tissue culture wells with glycoconjugates known to be recognized 
by CbpA: LNnT-albumin, 3' sialyl lactose-albumin, and the negative control albumin. 
The plates were then blocked with the albumin, washed and either full length CbpA 
Polypeptide R2, or polypeptide Rl were added for 15 minutes (0.8 pg/ml), then, without 
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washing, fluoresein labelled R6 pneumococci were added for 30 minutes, washed and 
adherent bacteria counted visually. 

Binding of R6 to carbohydrate without any peptide addition was the positive conho. and 
, was calibrated a, 100% (Tabie ,). In three separate experiments, CbpA hrl, ien^ or 
Polypeptide R2 competitively inhibited binding of pneumococc, to LNnT coated 

surfaces. CbpAritinenStbitri.^ 

inhibited to 65%, 53% and 74% of contml. The equivalent acttvtty of CbpA and M 
indicates die choline binding domain is not necessary for LNnT lectin acuvity of CbpA, 

1 0 and that R2 is a candidate LNnT lectin. 

. . .. tn L NnT binding of pneumococci to 3' sialyl lactose was not 

In contrast to binding to LNni, binding ” , fi0/ v This 

inhibited by R2 <79 and 101 %) compared to the full length P 

indicates L the sialic acid m-gnirion activity is lost when the CBD ts mtssrn. In 

15 contrast R1 seems to be active in recognition of sialic acid,apropertys ar 

b„, apparently masked in R2. This indieates drat folding of polypeptide tnto functtona 
domahu^induenced by die composition and lengdi of the polypeptide. Slight sequence 
variation is found in other strains (see Figure 2). Given the high degree of homology o 
sequence between R1 - R2, i, is ^possible that both R1 and K* are ne^ for 
20 lectin activity or tha, they are both lectin widt slightly different spec, domes ( ± 

acid). 
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Inhibition of Binding of R6 pneumococci to purified glycoconjugate 


by soluble forms of CbpA 



LNnT 

| 3' sialyl lactose 

Cbp form 

# pneumococci 
per monolayer 
(SD) 

% control 

# pneumococci 
per monolayer 

% 

control 

peiwell 

No peptide 

3282 

2421 (489) 
2210 (350) 

100% 

2611 

2115 (125) 

100% 

Full length 

2075 

63, 71, 64 

1933 

74 

CbpA 

1740 (167) 
1415 (50) 


1405 (240) 

66 

Polypeptide 

2461 

74, 53, 65 

2639 

101 

R2 

1288 (672) 
1440 (530) 


1670 (420) 

79 

Polypeptide 

3002 

91, 92, 112 

1052 

40 

R1 

2245 (182) 
2500 (310) 


1445 (526) 

68 

in= j experiments LNnt each 3 wells 





N=2 experiments SiL each 3 wells 


Lectin activity correlates with cell binding activity 

Human cells bear surface molecules that contain carbohydrates (glycoprotein, and 
glycolipid) and bacteria bind to these glycoconjngates by the carbohydrate despite very 
different protein or Upid backbones. Thus, bacteria bearing polypeptide with lectin 
20 activity in vitro can adhere to human cell surfaces. This direct correlation between in 
vrtro lectin activity and cell binding action is known for pneumococci. For example, 
LNnT competitively inhibits binding of pneumococci to TNF activated A549 human 
lung cells and blocks the progression of pneumonia in vivo. To establish that the lectin 
activity of truncates of CbpA reflects cell binding activity, CbpA and truncates were 
25 tested for inhibition of binding of pneumococci to lung cells (Table 2). Full length 
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CbpA and polypeptide R2 competitively inhibited adherence of pneumococc, to lung 
Js to 58% and 63% of controls respective*. Polypeptide R1 was no, effect, v. 
indicating the LNnt binding activity of R2 is needed for and explains pneuutococca 

binding to lung cells. 



1 5 r NnT Lectin activity is dependent on R2 , 

Th e N-terminal region of CbpA contains two repels of-UO amino -ds eacMsee 

Figure 1. regions A and C within poiypeptide R2>. To study the reiahve conhtbuhono 
me two domains to bio-activity Ri . containing only domain A was compared “ 
full lenglh CbpA. When tested in the adherence assay, polypeptrde Rl 
20 adherence to LNnT at ali (91,92, and 112% of wild type). However, polypephde^ 
demonstrated some inhibition of binding to Sialyl lactose (68 and 40% o 
demonstrates that dte polypeptide R2 is mquired for LNnT lectm acuv.ty and R2 
candidate LNnT lectin domain. InconhastRi seems to be active in recognrhon of srai 

acid. 

25 
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Antibodies t N-terminal Domain of CbpA Block Cell Binding: 

Given that the N-terminal domain of CbpA binds cells, interference with the N-te rminal 
domain activity will prevent or reverse bacterial binding to cells or purified 
glycoconj ugates . One such mechanism of interference is antibody. 


Inhibition of binding of R6 pneumococci to LNnT coated surfaces by 
anti-CbpA R2 antibodies 



# pneumococci per 
monolayer (SD) 

% control (mean) 

Prelmmune Antibody 

198 (64); 88 (4) 

100% 

Antibody to Truncate R2 

c . . i i « 

56 (11); 9 (2) 

28%; 10% 


5 pi of rabbit antibody undiluted + 5 pi 2 x 10 7 R6x Preincubate, 6 at RT x 30 min, then 
add to LNnT coated wells for adherence assay. Two independent experiments are shown. 


Antisera raised to the recombinant N-terminal domain of CbpA (R2) was tested for the 
ability to block adherence of pneumococci to LNnT. Rabbit polyclonal anti CbpA 
antisera (5pl) plus 5 pi of 2 x 10 7 of labeled bacteria were incubated at room temperature 
for 30 min. This mixture was overlaid onto immobilized LNnT for 30 min., and then 
washed 3 times with PBS to remove unbound bacteria. Bacteria bound to the plates were 
enumerated microscopically and results are presented as the mean values plus the 
standard deviation from six wells. Results shown in Table 3 demonstrate that antisera 
raised against the R2 polypeptide blocked the binding of pneumococci to LNnT. Figure 
5 demonstrates a titration curve of prelmmune versus anti-CbpA R2 antibody for 
inhibition of binding of pneumococci R6x to the model receptor LNnT. Greater than 
70% of pneumococcal adherence was blocked by anti-R2 at dilutions of 1 : 1 00 and 1 :200. 
Further dilution to 1 :400 eliminated activity indicating the specificity of the effect. 

The CbpA used to prepare the antisera shown in Table 3 and Figure 5 was raised against 
CbpA from serotype 4. The R6x strain pneumococci used in the inhibition of adherence 
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assay was derived from serotype 2. The ability of the antibody to block adherence of a 
heterologous serotype of bacteria indicates cross protective activity across serotypes. 
Such activity is highly desired for an effective vaccine immunogen. 

5 Activity of antibodies to native conformation of N terminus of CbpA: 

CbpA can be purified over a choline affinity column from its natural host, the 
pneumococcus, as described by Rosenow et al. Alternatively, a polyhistidine tag can be 
engineered onto the end of the gene such that the transcribed protein is extended by 
10 several histidine residues. These residues facilitate purification over a nickel affinity 
matrix Purification of full length polypeptides as opposed to shorter truncates favors 
retention of the native tertiary structure. CbpA purified especially from pneumococcus 
but also from F.. coli or other host bacteria by these biochemical means retains its native 
tertiary structure. Used as an immunogen, natively folded CbpA engenders antibodies 
1 5 that potentially differ from those elicited by immunization with a truncate which may 
fold differently. Similarly, CbpA used as a therapeutic may have tertiary structure 
differing from the truncate which would improve its ability to block adherence. Given 
these considerations, it may be advantageous to produce CbpA as full length protein 
allowing it to fold to its native tertiary structure and then cleave the C terminal (CBD) 
20 away biochemically. For example, treatment with hydroxylamine will cleave CbpA at 
amino acid position 475 of serotype R6x and of serotype 4 of choline binding protein 
A, separating the N and C termini. The N terminal fragment is then suitable as a 

therapeutic or an immunogen. 

25 Alternatively, native CbpA can be used as an immunogen and antisera to the active 
structure. The bioactive anti-N terminal antibodies in this mixture can be enriched by 
removing antibodies to the BD by absorption. Such an antibody was prepared by 
incubating 200, ul serum with 1 x 10 8 CbpA defective - bacteria for 1 hour at R1 . The 
other choline binding proteins on this mutant absorb out anti-CBD antibodies which are 
30 then removed from the antiserum by centrifuging and removing the bacteria. 

To demonstrate the bioactivity of absorbed anti CbpA antibodies, the ability of the 
absorbed antiserum to block pneumococcal adherence to the model receptor LNnT was 
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determined. R6x pneumococci were incubated with 
then added to wells coated with LNnT albumin. 


1 :600 dilution of antiserum and 


10 




Prelmmune antiserum 

Anti CbpA antiserum 

Anti CbpA antiserum absorbed 
to remove CBD antibodies 


e rum blo cks 
Number of pneumococci per well ± 

(% of control) 

563 ±11 (1QQ% ) 

479 ±11 ( 85 %) 

294 ± 72 (5 2%) 

175 ±38 (31%) 


7 results indicate a* antibodies t0 the N tetmina, domain of Cbp/A in ,ts native 
conformation suongly block adherent ms activity is ^ than ^ (<> ^ ^ 

o tgute whtch was imctive a, 1 :600 diintion. Further demonstration of this activity 
o a sorbed anti CbpA antiserum is shown by die titration study of Figure 5. Baseline 

20 ™ ° f PneUm0C0CCi TyPC 4 to LN ”T coated wells is shown by the triangles 

-incubation of pneumoccoci with unabsorbed (squares) or absorbed (diamonds) 

antiserum at the various dilutions indicated yielded decreased adherence. Theiacha, 

an “ era Sh ° Wed S,,mlar decreases ta a<Ulere "“ demonstrates that the majority of 

anti JT T‘ Vity ° fan ‘ ib0dy “ ^ “““ ta N -™i"us («., removal of 

annbod.es m dm cholme binding domain by absorption does not decrease bioactivity. 


25 


EXAMPLE 3: Passive Protection With Anti-R2 Antiserum 
Generation of Rabbit Immune Sera: 

30 ‘T "" aSai " S ‘ P ° ,yPq>tide “ (CbpA '™ te > and CbpA wem generated 

rrzr ver ' pa) - Fo,,owin8 ° f pre — 

as immunized with 250 pg R2 containing both amino teiminal repeats 
(prepara, mu 4g3:5 8 above), in Complete Freund, Adjuvant. The rabbi, was g L . 
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boos, of 125 pg R2 in Incomplete Freund's Adjuvant on day 2 1 and bled on day 31. A 
second rabbit was similarly immunized with purified CbpA. 


Passive Protection in Mice: 

5 C3H/HeJ mice (5/group) were passively immunized inuaperitoneally by with 1 00 pi of 

. ,-2 dilution of rabbit and R2 or preimmune sera in sterile PBS (pre-immune and day 
31 immunesem). One hour after administration of serum, mice were challenged wtth 
,600 CFU Streptococcus pneumoniae serose 6B (attain SP317). Mice were momtored 
for 14 days for survival. Eighty percent of the mice immunized with rabb.t tmmune 

10 serum raised against polypeptide R2 survived challenge (Figure 4). All mice immumzed 

with pre-immune rabbit serum were dead by day 7. 


This data demonstrates that antibodies specific for CbpA are protective agamst 
systemic pneumococcal infection. The date furiher indicate that rite choline-bmdtng 
15 region is no. necessary for protection, as antibody specific for the truncated protem 
polypeptide E2, lacking the conserved choline binding repeats, was sufficed for 
protection, in addition, serum directed to CbpA of seretype 4 was protective agamst 

challenge with serotype 6B. 


20 EXAMPLE 4: Active Protection With Anti-Rl Antiserum 

C3H/HO mice (10/group) were immunized inttaperitoneally with CbpA truncate 
protein R1 (15 pg in 50 pi PBS, plus 50 pi Complete Freund's Adjuvant). A group 
of ,0 sham immunized mice received PBS and adjuvant. A second immuntzatton was 
25 administered four weeks later, 15 pg protein i.p. with Incomplete Freund's Adjuvant 
(sham received PBS plus IFA). Blood was drawn (retto-orbital bleed) at weeks 3, , 
and 9 for analysis of immune response. The EUSA end point anti-CbpA truncate titer 
of pooled sera from the 10 CbpA immunized mice a. 9 weeks was 4,096,000. No 
antibody was detected in sera from sham immunized mice. Mice were challenged at 
30 week 10 with 560 CFU Streptococcus pneumoniae serotype 6B (strain SPSJ2p, 
provided by P. Flynn, St. Jude Children's Research Hospital, Memphis, TN). Mice 
were monitored for 14 days for survival. Eighty pereen. of toe mice immunized wtth 
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RI ~ cha,,enge - An - — - - - « 


Th B da, demonstrates that immunization with a recombimm fragIneM of 

“ P C, ‘° n ° f SpeCiflC of protecting againa system ,c 

Poooowooccal infection and death. The data htrdter indicates tha, * 

Add t r 7 ”^ f ° r PrOKC1,0n ' “ ‘ he i,nrn ' ,no « en is to truncated protein RI 

.o * rcsuto ■*“ ^ * smg,e ^ *— > ««« - ^0,™ 

10 K- Ve resp0nse - Cross Ptototion is also demonstrated as the 

recombinant pnenmococcal protein was generated based on serotype 4 DNA sequence 

and protect, on was observed following, challenge with a serotype SB isolate. 

EXAMPLE 5 Prophylaxis against nasopharyngeal colonization in the infant 
rat 

15 Hrr™"*' d ° main ^ C>A “ pneumococcal 

~ were administered hnncate peptides, then changed with pneumoctL t, 
olomzation of the nasopharynx was evaluated. 

20 t “ y with 10 pi ° f pbs °- 8 * * « 

ZlZ mia la,er Type 3 pneumococci (strafa sm > » ~* > x 

cfer) were tntroduced intranasally. To detennine the abiiity ofdte polypeptide to 
compennvely mhtbtt pnenmococcal adhenmce and colonizaSon, nasal washing was 

(mean 6 O 75 Z ^ r ' Ce '" n8 *" 22 °°' «» - 8700 

(mean 6075, colonies per ,0pi. Animats treated wtd, locate R 2 showed the greatest 

decrease (3600, 3500, 2500, 2100) to mean 2925 bactena 10(l| (48% Qf 

mais treated with truncate R I also showed decreased colonizaSon (5000, 4800, 3500 
1600) to mean 3725 (61% of control). ’ 

,!:Zr: 7 ™ ° f ^ <«• totan, invention 

pneumococcal .J^T ^ ^ ^ ^ « 


30 
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Discussion: 


AS demonstrated by the experiment* polypeptide M when: 1) admimstered as a vacerne 
5 antigen elicits protective antibodies and is a prefened compos, tion for a vaccrne 
formulation; and 2) delivered as a peptide to the respiratory tract and/or nasophnynx 
receptor, competitively prevents pneumococcal attachment and ,s a preferre 
composition for a prophyfcctic and therapeutic agent against colonisation or mvastve 
disease. Also, truncates of CbpA function as lectins wrthout the CBD. wo 
,0 carbohydrates am recognized: LNnT by a peptide containing both N-mrmmal repeats ( 
and C) in Figure 1 and sialic acid by a peptide containing only the single most 
terminal repeat (A). The truncate containing the N-tenninal repeat polypepnde R1 an 
R2 demonstrates lectin activity in cell culture assays as well. 


l5 important feamras of polypeptide R2 activity include: 1) complete correlation of 
bioactivity of polypeptide R2 and full lengti, CbpA for recogmtion of punfled 
glycoconjugate receptor analogs, lung cells and annual models. Correlation ts a so 
demonsuated for antibodies to them; and 2) cross protection between type denve 
agents and bacteria in in vi/ro assays using other serotype (e.g. 6B and 2) whtch ,s 
20 irapottant for useful vaccine, prophylactic and therapeutic modalities. 

While the invention has been described and illustrated herein by references to various 
specific material, procedures and examples, it is understood mat the invention ts no, 

' restricted to tile particular material combinations of material, and procedures selected 
25 for mat purpose. Numerous variations o, such derails can be implied as wtil be 
appreciatedby those shilled in the art. Likewise, any references cried herem am, to e 
extent relevant to the disclosure of the invention, to be considered as incorporated herein 

by reference. 
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WHAT IS CLATMFn IQ 


isolated polypeptide comprising an amino acid sequence of a N-terminal 
choline binding protein A truncate. 

2. The isolated polypeptide ofclaim 1, wherein the amino acid sequence is set forth 

in SEQ ID NO 1, including fragments, mutants, variants, analogs, or 
derivatives, thereof. 


3. The isolated polypeptide ofclaim 1, wherein the amino acid sequence is set forth 

m SEQ ID NO 3, including fragments, mutants, variants, analogs, or 
derivatives, thereof. 


4. The isolated polypeptideof claim 1 , wherain the amino acid sequence is set forth 
m SEQ ID NO 6. 


5- The isolated polypeptide ofclaim 1, wherein the amino acid sequence is set forth 

in SEQ ID NO 7, including fragments, mutants, variants, analogs, or 
derivatives, thereof. 


6. TEe isolated polypeptide ofclaim I, wherein the amino acid sequence is set forth 

in SEQ ID NO 9, including fragments, mutants, variants, analogs, or 
derivatives, thereof. 

7- An isolated polypeptide comprising an amino acid sequence of a N-terminal 
choline binding protein A truncate having the amino acid as set forth in SEQ ID 
NO 24, wherein the polypeptide exhibits its tertiary structure. 

The isolated polypeptide ofclaim 7, wherein the tertiary structure corresponds 
to that present in the native protein. 


BN S DOC ID; <WO 9951 187A2J_> 



WO 99/51 187 


PCT/US99/07668 


-70- 

9. The isolated polypeptide of claim 7, wherein the polypeptide is made by 
cleaving a full length choline binding protein A with hydroxylamine, wherein the 
hydroxylamine cleaves the choline binding protein A at amino acid 475 thereby 
creating the N-terminal choline binding protein A truncate. 

10. An isolated analog of the polypeptide of claim 1 . 

11. An isolated polypeptide according to claim 1 0, wherein the analog comprises the 
amino acid sequence having an N-terminal methionine or an N-terminal 

polyhistidine. 

12. The isolated polypeptide according to claim 1, wherein said fragments are 
proteolytic digestion products of the polypeptide. 

13. An isolated polypeptide comprising an amino acid sequence of a N-terminal 
choline binding protein A truncate, wherein the polypeptide has lectin activity 

and does not bind to choline. 

14. An isolated immunogenic polypeptide comprising an amino acid sequence of a 
N-terminal choline binding protein A truncate. 

15. The immunogenic polypeptide of claim 14, wherein the amino acid sequence is 

set forth in SEQ ID NO 1, including fragments, mutants, variants, analogs, or 
derivatives, thereof. 

16. The immu nogenic polypeptide of claim 14, wherein the amino acid sequence is 
set forth in SEQ ID NO 3, including fragments, mutants, variants, analogs, or 

derivatives, thereof. 

17. The immunogenic polypeptide of claim 14, wherein the amino acid sequence is 
set forth in SEQ ID NO 7, including fragments, mutants, variants, analogs, or 

derivatives, thereof. 


BNSDOCID <WO 9951 187 A2_l_> 



WO 99/51187 


PCT/US99/07668 


-71- 


18. 


immunogenic polypeptide of claim 14, wherein toe amino acid sequence is 

se, forth in SEQ ID NO 9, including fragments, mutants. Variants, analogs or 
derivatives, thereof. 


tuuiJUU dLflU 


isolated nucleic acid encoding a polypeptide comprising 
sequence of a N-terminal choline binding protein A truncate. 

20. The isolated nucleic acid of claim 19, wherein the nucleic acid is set forth in 

SEQ ID NO , 2, including fragments, mutants, varrants, analogs, or derivahves 
thereof. ’ 


21. The isolated nucleic acid of claim 19, wherein the 
SEQ ID NO 14, including fragments, 
thereof. 


nucleic acid is set forth in 
mutants, variants, analogs, or derivatives 


22. The isolated nucleic acid of claim 1 9, wherein 


SEQ ID NO 1 7, including fragments, mutant s, 
thereof. 


the nucleic acid is set forth 


m 


variants, analogs, or derivatives. 


23. The isolated nucleic acid of claim 19, wherein toe nucleic acid is set forth in 
SEQ ID NO 19, including fragments, mutants, 
thereof 


variants, analogs, or derivatives, 


24. The isolated nucleic acid of claim 19, wherein the nucleic acid is DNA. 

25. The isolated nucleic acid of claim 1 9, wherein toe nucleic acid is cDNA. 

26. The isolated nucleic acid of claim 19, wherein the nucleic acid is genomic DNA. 


27. The isolated nucleic acid of claim 


19, wherein the nucleic acid is RNA. 


28. An isolated nucleic acid of claim 19 


transcription. 


operatively linked to a promoter of RNA 
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29. A vector which comprises the nucleic acid molecule of claim 19. 

30. The vector of claim 29, wherein the promoter comprises a bacterial, yeast, insect 
or mammalian promoter. 

31. The vector of claim 30, wherein the vector is a plasmid, cosmid, yeast artificial 
chromosome (YAC), bacteriophage or eukaryotic viral DNA. 

32. A host vector system for the production of a polypeptide which comprises the 
vector of claim 30 in a suitable host cell. 

33. The host vector system of claim 32, wherein the suitable host cell comprises a 
prokaryotic or eukaryotic cell. 

34. A cell line comprising the nucleic acid of claim 19. 

35. A method of obtaining a polypeptide in purified form which comprises: 

(a) introducing the vector of claim 19 into a suitable host cell; 

(b) culturing the resulting host cell so as to produce the polypeptide; 

(c) recovering the polypeptide produced in step (b); and 

(d) purifying the polypeptide so recovered in step (c). 

36. An antibody capable of specifically binding to the polypeptide of claim 1 or 7. 

37. The antibody of claim 36, wherein the antibody is a monoclonal antibody. 

38. The antibody of claim 36, wherein the antibody is a polyclonal antibody. 

39. The antibody of claim 36, wherein the antibody is a chimeric (bispecific) 
antibody. 
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40. A pharmaceutical composition comprising an amount of the polypeptide of claim 
1 and a pharmaceutically acceptable carrier or diluent, 

41 . A method of inducing an immune response in a subject which has been exposed 
to or infected with a pneumococcal bacterium comprising administering to the 
subject an amount of the pharmaceutical composition of claim 40, thereby 
inducing an immune response. 


42. A method for preventing infection by a pneumococcal bacterium in a subject 
comprising administering to the subject an amount of the pharmaceutical 
composition of claim 40 effective to prevent pneumococcal bacterium 
attachment, thereby preventing infection by a pneumococcal bacterium. 

43. The method of claim 42, wherein the pharmaceutical composition is delivered to 
the respiratory tract or nasopharynx. 

44. A method for preventing infection by a pneumococcal bacterium in a subject 
comprising administering to the subject an amount of a pharmaceutical 
composition comprising the antibody of claim 36 and a pharmaceutically 

acceptable carrier or diluent, thereby preventing infection by a pneumococcal 
bacterium. 

45. A vaccine which comprises the polypeptide of Claim 1 and a pharmaceutically 
acceptable adjuvant or carrier. 

46. The vaccine of Claim 45, wherein the polypeptide has the amino acid sequence 
as set forth in any of SEQ ID NOS: 1, 3-7, 9-1 1, 22, and 23. 

47. The vaccine of Claim 45 wherein the polypeptide comprises an amino acid 

sequence of a N-terminal choline binding protein A truncate as set forth in 
Figure 2. 
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18. A vaccine which comprises the polypeptide having the amino acid sequence 
which comprises a conserved region as set forth in Figure 2 and a 
pharmaceutically acceptable adjuvant or carrier. 

49. The vaccine of Claim 48 wherein the conserved region is selected from amino 
acid sequence 158 to 172; 300 to 321; 331 to 339; 355 to 365, 367 to 374, 379 
to 389; 409 to 427; and 430 to 447. 

50. A vaccine comprising an isolated nucleic acid encoding the polypeptide of Claim 
1 and a pharmaceutically acceptable adjuvant or earner. 

51. A vaccine comprising an isolated nucleic acid of Claim 19 and a 
pharmaceutically acceptable adjuvant or carrier. 

52. A vaccine comprising the vector of Claim 29 and a pharmaceutically acceptable 
adjuvant or carrier. 

53. A method for treating a subject infected with or exposed to pneumococcal 
bacterium comprising administering to the subject a therapeutically effective 
amount of the vaccine of any of Claims 45-51 or 52, thereby treating the subject. 
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A 


C-terminus 



B 



Choline Binding Domain 


CbpA 



R2 



R1 


Figure 1 
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-AVASLVMGSVVHATB'rEVTrOVATSSNRANESOTEHRKAAKO O— L 

10 20 30 40 SO 60 70 

SPB328 (23F) Cbp -AVASLfWGSWHATEKEVTrQVATSSNKANKSQTEHMKAAKQ VDEYIKKKL O— L 

SPB365 (23F) Cbp -AVASLF1CSWHATEKEVTTQVATSSNRANKSQTEHMKAAKQ VDEYIKKKL- O--L 

SPB10S ( 6B ) CbpA VAVASLVMGSVVHATEKEVTTQVATSSNRANESQAGHRKAAEQF DEYIXTMI— Q— L 

SPSJ12 < 19 A) Cfc*> -AVASLVJESWHATENEGTTQAPTSSNRGNESQAEHMKAAKQ VDEYIEKML- - -Q T. 

SPB33 1 < 14 ) CbpA — VASLFW3SWHATEKEVTIQVATSSNKANKSQTEHMKAAKQ VDEYIKKKL Q L 

SPR332 (9V) CbpA CTVASLVMG$WHA.TENERTIQVPTSSIIRGK PERRKAABQF DEYINKKI O L 

ATCC2 CfcpA trun VAVASLVMGSVVHATEKEVTTQVPTYSNMAK TEHRKAAKQV VDEYIEKMLHEIO— L 

R6X(2) CbpA tru ENEGSTQAATSSNMAK TEHRKAAKQV VDSYIEKMLREIO— L 

SPSJ9 (14) CbpA Y-XASLFLGGWHAE GVRSENNPTVTSSGQDISKKYADE VKSHLERILSEIOTNL 

ATCC6B Cbpa tru ASLFLGGWHAE GVRSGNNSTVTSSGQDXSKKYADE VSSHLQSILKDVNKNL 

Ntype4 CbpA tru CtVASLVtGSWHATENEGAT'QVPTSSNRANESQAEQGEQPKKLDSERDKARKEVEEYVKKXVGESYAXS 
ATCC4 CbpA trun -rVASLVMGSWHATENEGATQVPTSSNRANESQAEQGEQPKKLDSERDKARKEVEEYVKKIVGESYAKS 


PRRKH'ICINVMJTTKLSAXKTEYLNGLSVLEEKSEAELPSEIKA KLDAAFEOFKKDTL 

80 90 100 110 120 130 TTn 

SPB328 (23r) Cbp DRRKHTQNVGLLTKLGVIKTEYLHGLSVSraQCSEAELPSEIKA KLDAAFEOFKKDTL 

SPB365 (23F) Cbp DRRKHTQNVGLLTKLGVIKTEYLflGLSVSKKKSEAELPSElKA KLDAAFEQFKKDTL 

SPB105 ( 6B ) CfcpA DRRKHTQNFAMIKLSRIKTEYLRKLNVLEEKSKAELPSETKK EIDAAFEQFKKDTNR 

SPSJ12 { 19A) Cbp DRRKHTQNVGLLTfCjGAIKTEYLRGLSVSKEKSTAELPSEIKE KLTAA5XQFKKDIL 

SPB3 31(14) CbpA DRRKHTQNVGIXTKLGVIKTEYLHaiiSVSTCKKSEAELPSEXKA KLDAAFEOFKKDTL 

SPR332 ( 9V) CbpA DKRKHTQNLAFKIQLSRIKTEYLNGL KEKSEAELPSKIKA SLQAAFKQFKMDTL 

ATCC2 CbpA trun DRJRKHTQNFAFNMKLSAIKTEYLYGLK EKSEAELPSEVKA KLDAAFEQFKKDTL 

R6X ( 2 ) CbpA truDRRKHTQHUALNIKLSAlKIXYLRELNVLEEKSKDELPSElKA KLDAAFEKFKKDTL 

SPSJ9 ( 14 ) CbpA DRS®IKTVNL3MCQDIKRTYLYEI2TVLEDKSKA£LPSKIKA ELDAAFEQFKKDEL 

ATCC6B Cbpa tru KKVQHTONADFHKKLSKHCrKYLYELNVLEEKSEAELTSKTKETKEELTAAFEQFKKOTL 

CbpA tru TKKRHTITVALVKELNNIKNEi"LN — KIVESTSESQLQILMMESRSKVDEAVSKFEKDSSSSSSSDSSTK 
ATCC4 CbpA trun TTCKRHTITVALVNSLfJtJIJdEYlil — KH/ESTSESQLQItMMESRSKVDEAVSKFEKDSSSSSSSDSSTK 


SPB328 (23F) Cbp 
SPB365I23F) Cbp 
SPB10S(6B) CbpA 
SPSJ12 (19A) Cbp 
SPB33K14) CbpA 
SPR332 (9V) CbpA 
ATCC2 CfapA trun 
R6X ( 2 ) CbpA tru 
SPSJ9 (14) CfapA 
ATCC6B Cbpa tru 
Ncype4 CbpA tru 
ATCC4 CfapA trun 


"l50 
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160 170 
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“ -AHD^KEEDRKlC/PTITyKTI ^^TA^nv^/ 
180 


190 200 2io 
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A^J^ KEICDL ^KYTlTyryKTXiELDXAESPVEV 
AXAQI^CT$p!NYPOTryTOTljELEIAESDVEV 
AEDQKEEimHICfPTITYKTLEL£IAESDVTV 
•AEt^KEKDLRNlfPTOTracT7LCT.nTaTy<?nmftr 

PTEPE200/AEAEKKVEEA030/AEAKKKAKAQKEEDHFlTYPTXTYXTrX)LEIAEFDVKV 

K ICEK VAEflEKK VA FA FX K AKA0KEEDRRNYPTE7T VTr px -ET -F T AESOTEV 

^K PGEKVAE?^KKAVl£ftKKK — -AEIX3KEHX)RFNYP^7ITKTLELEXAEFT3VKV 

PTEPGKXVAEAKKKVEEAEXK -AKAQKEEDYRNYPTITOKTLiELEXAESDVKV 
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SPB328I23F1 Cbp 
SPB365 ( 2 3F) Cbp 
SPB105 ( 53) CbpA 
SPSJ12 ' 19A) Cbp 
SPB33U14? CbpA 
3PR332 { 9V) CbpA 
ATCC2 CbpA trun 
R6X(2 } CbpA tra 
SPSJ9(14) CbpA 
ATCC69 Cfcpa tru 
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*“ ° 1U Gly “* ^ G1 " ™ »» »r sei Asn A tg AU As„ 

1 


BNSDOCID: <WO_9951187A2J_> 


5 


10 


15 



WO 99/51187 


PCT/US99/07668 


G iu Ser Gin Ala Glu Gin Gly Glu Gin Pro Lys Lys Leu Asp Ser Glu 


20 


25 


30 


Arg 


Asp Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys He Val 


35 


40 


45 


Gly Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr He Thr Val 


50 


55 


60 


Ala Leu 
65 


Val Asn Glu Leu Asn Asn lie Lys Asn Glu Tyr Leu Asn Lys 

75 80 


70 


lie Val Glu Ser Thr Ser Glu Ser Gin Leu Gin lie Leu Met Met Glu 


85 


90 


95 


Ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp Ser 

105 110 


100 


Thr Lys Pro Glu Ala Ser Asp 


Ser Ser Ser Ser Ser Ser Asp Ser Ser 

115 120 125 


Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala Glu 


130 


135 


140 


Ala Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys 


145 


150 


155 


160 


Glu Glu Asp Arg Arg Asn Tyr Pro Thr XI. Thr Tyr Ays Thr L.u Glu 
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165 


170 


175 


Leu Glu lie Ala Glu Ser 


Asp Val Glu Val Lys Lys Ala 


180 


185 


Glu Leu Glu 
190 


Leu Val Lys Val Lys Ala 


Asn Glu Pro Arg Asp Glu Gin 


195 


'200 


205 


Lys lie Lys 


Gin Ala Glu Ala Glu 


Val Glu Ser Lys Gin Ala 


210 


215 


Glu Ala Thr Arg Leu 
220 


225 230 235 

240 


Arg Ala Asp Ala Lys Glu 


245 


Gin Gly Lys Pro Lys Gly Arg Ala Lys Arg 


250 


255 


Gly Val Pro Gly Glu Leu Ala Thr Pro A.p Lys Lys Glu ^ ^ ^ 
260 


Lys Ser Ser Asp Ser Ser 


Val Gly Glu Glu Thr 


275 


280 


Leu Pro Ser Pro 
285 


Ser 


Leu Lys Pro Glu Lys Lys 
290 


Val Ala Glu Ala Glu 
295 


Lys Lys Val Glu Glu 
300 


Ala Lys Lys Lys' Ala 
305 


Glu Asp Gin Lys Glu Glu 


310 


315 


Asp Arg Arg Asn 


Tyr 

320 
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Pro Thr Asn Thr Tyr Lys Thr Leu 


325 


Glu Leu Glu He Ala Glu Ser Asp 


330 


335 


Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys 


340 


345 


350 


Glu Pro Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala Glu Val Glu 


355 


360 


365 


Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys lie Lys Thr Asp Arg 
370 375 380 


Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys 
385 390 395 400 


Val Lys Glu Lys Pro Ala 
405 


< 210 > 2 
<211> 655 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 2 

Glu Asn Glu Gly Ala Thr Gin Val Pro Thr Ser Ser Asn Arg Ala Asn 

! 5 10 15 

Glu Ser Gin Ala Glu Gin Gly Glu Gin Pro Lys Lys Leu Asp Ser Glu 
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20 


25 


30 


Arg Asp Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys He Val 


35 


40 


45 


Gly Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr lie Thr Val 


50 


55 


60 


Ala Leu Val Asn Glu Leu Asn Asn lie 
65 70 


Lys Asn Glu Tyr Leu Asn Lys 


75 


80 


lie Val Glu Ser Thr Ser o Glu Ser Gin 


Leu Gin lie Leu Met Met Glu 


85 


90 


95 


Ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp Ser 


100 


105 


110 


Ser Ser Ser Ser 
115 


Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser Asp 
120 125 


Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala Glu 


130 


135 


140 


Ala Lys Lys Lys Val Glu GlU Ala Glu 


145 


150 


Lys Lys Ala Lys Asp Gin Lys 


155 


160 


Glu Glu Asp Arg Arg Asn Tyr Pro Thr 


lie Thr Tyr Lys Thr Leu Glu 


165 


170 


175 
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Leu Glu lie Ala Glu Ser Asp Val Glu 
180 185 

Leu Val Lys Val Lys Ala Asn Glu Pro 
195 200 

Gin Ala Glu Ala Glu Val Glu Ser Lys 
210 215 

Lys Lys lie Lys Thr Asp Arg Glu Glu 
225 230 

Arg Ala Asp Ala Lys Glu Gin Gly Lys 
245 

Gly Val Pro Gly Glu Leu Ala Thr Pro 
260 265 

Lys Ser Ser Asp Ser Ser Val Gly Glu 
275 280 

Leu Lys Pro Glu Lys Lys Val Ala Glu 
290 295 

Ala Lys Lys Lys Ala Glu Asp Gin Lys 
305 310 

Pro Thr Asn Thr Tyr Lys Thr Leu Glu 
325 
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Val Lys Lys Ala Glu Leu Glu 
190 

Arg Asp Glu Gin Lys lie Lys 
205 

Gin Ala Glu Ala Thr Arg Leu 
220 

Ala Glu Glu Glu Ala Lys Arg 
235 240 

Pro Lys Gly Arg Ala Lys Arg 
250 255 

Asp Lys Lys Glu Asn Asp Ala 
270 

Glu Thr Leu Pro Ser Pro Ser 
285 

Ala Glu Lys Lys Val Glu Glu 
300 

Glu Glu Asp Arg Arg Asn Tyr 
315 320 

Leu Glu lie Ala Glu Ser Asp 
330 335 
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Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys 


340 


345 


350 


Glu Pro Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala Glu Val Glu 


355 


360 


365 


Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys He Lys Thr Asp Arg 


370 


375 


380 


Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys 


385 


390 


395 


400 


Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro Lys 


405 


410 


415 


Ala Glu Lys Pro Ala Pro Ala Pro Lys Pro Glu Asn Pro Ala Glu Gin 


420 


425 


430 


Pro Lys Ala Glu Lys Pro Ala Asp Gin Gin Ala Glu Glu Asp Tyr Ala 


435 


440 


445 


Arg Arg Ser Glu Glu Glu Tyr Asn Arg Leu Thr Gin Gin Gin Pro Pro 


450 


455 


460 


Lys Thr Glu Lys Pro Ala Gin Pro Ser Thr Pro Lys Thr Gly Trp Lys 


465 


470 


475 


480 


Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala 


BNSDOCIO: <WO 99511 87 A2_l_> 



WO 99/51187 


8 


PCT/US99/07668 


485 


490 


495 


Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu Asn Ser Asn 
500 505 510 


Gly Ala Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr 
515 520 525 

Leu Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly 
530 535 540 

Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Leu 

545 550 555 560 

Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ser Met Ala 

565 570 575 


Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn 
580 585 590 


Gly Asp Met Ala Thr Gly Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr 
595 600 605 


Leu Glu Ala Ser Gly Ala Met Lys Ala Ser Gin Trp Phe Lys Val Ser 
610 615 620 

Asp Lys Trp Tyr Tyr Val Asn Gly Ser Gly Ala Leu Ala Val Asn Thr 
625 630 635 640 
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Thr Val Asp Gly Tyr Gly 


Val Asn Ala Asn Gly Glu Trp Val Asn 


645 


650 


655 


<210> 3 
<211> 284 
<212> PRT 

Streptococcus pneumoniae 


<400> 3 


Glu Asn Glu Gly Al. Thr Gin Val Pro Thr Ser Ser Asn Arg Ala 


Asn 


10 


15 


Glu Ser Gin Ala Glu Gin Gly Glu Gin 


20 


Pro Lys Lys Leu Asp Ser Glu 


25 


30 


Arg Asp Lys Ala Arg Lys 


Val Glu Glu Tyr Val 


35 


40 


Lys Lys lie Val 
45 


y Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg «is Thr II, Thr Val 
50 


Ala Lau val Asn Glu Leu Asn Asn He Lys Asn Glu Tyr Leu Asn Lys 


He V,1 Glu Ser Thr Ser Glu Ser Gin Leu Gin He Leu Met Met 


80 


Glu 


85 


90 


95 


ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp Ser 


_9951187A2_L> 



WO 99/51187 


10 


PCT/U S99/07668 


100 


105 


110 


Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys 


Pro Glu Ala Ser Asp 


115 


120 


125 


Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala Glu 


130 


135 


140 


Ala Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys 


145 


150 


155 


160 


Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu Glu 


165 


170 


175 


Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu 


180 


185 


190 


Leu Val Lys Val Lys Ala Asn Glu 
5 2 00 

Gin Ala Glu Ala Glu Val Glu Ser 
210 215 

Lys Lys lie Lys Thr Asp Arg Glu 
225 230 

Arg Ala Asp Ala Lys Glu Gin Gly 
245 


Pro Arg Asp Glu Gin Lys lie Lys 
205 

Lys Gin Ala Glu Ala Thr Arg Leu 
220 

Glu Ala Glu Glu Glu Ala Lys Arg 
235 240 

Lys Pro Lys Gly Arg Ala Lys Arg 
250 255 
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Gly Val 


Pro Gly Glu Leu Ala Thr Pro 


260 


265 


Asp Lys Lys Glu Asn Asp Ala 
270 


Lys Ser Ser 


Asp Ser Ser Val Gly Glu Glu 


Thr 


Leu 


275 


280 


<210> 4 
<211> 106 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 4 


Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys Val G1 
1 


Glu Ala 
15 


Lys Lys Lys Ala Glu Asp 
20 


Gin Lys Glu Glu Asp Arg Arg Asn 


25 


30 


Tyr Pro 


Thr Asn Thr Tyr Lys Thr 
35 


Leu Glu Leu Glu lie 


Ala Glu Ser Asp Val 


40 


45 


Glu Val Lys Lys Ala Glu Leu Glu Leu Val 


50 


55 


Lys Glu Glu Ala Lys Glu 
60 


Pro Arg Asn Glu Glu Lys 


65 


70 


Val Lys Gin Ala Lys Ala 


Glu Val Glu Ser 


75 


80 


Lys Lys Ala Glu Ala 


Thr Arg Leu Glu Lys lie 


Lys Thr Asp Arg Lys 
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85 


90 


95 


Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala 
100 105 


<210> 5 
<211> 109 
<212> PHT 

<213> Streptococcus pneumoniae 


<400> 5 

Thr Glu Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu 
! 5 10 15 


Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr 
20 25 30 


Pro Thr lie Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp 


35 40 

Val Glu Val Lys Lys Ala Glu Leu 
50 55 

Glu Pro Arg Asp Glu Gin Lys He 
65 70 

Ser Lys Gin Ala Glu Ala Thr Arg 
85 


45 


Glu Leu Val Lys Val Lys Ala Asn 
60 


Lys Gin Ala Glu Ala Glu Val Glu 
75 80 


Leu Lys Lys lie Lys Thr Asp Arg 
90 95 
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Glu Glu Ala Glu Glu Glu Ala Lys Arg ^ ^ ^ ^ 
100 105 


< 210 > 6 
< 211 > 4 
<212> PRT 

< 213 > Streptococcus pneumoniae 
<400> 6 

Lys Xaa Xaa Glu 
1 


<210> 7 
< 211 > 376 
<212> PRT 

<213> Streptococcus pneumoniae 


<4Q0> 7 


01 “ G1 “ Thr scr s „ Asn 


10 


Thr ““ Ar9 ^ “A «- val Val Glu 


20 

Lys Met Leu Arg Glu lie Gin 
35 


25 


Met Ala Lys 
15 

Tyr lie Glu 
30 


Leu Asp Arg Arg Lys His Thr 
40 


Gin Asn 


45 
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Val Ala Leu Asn 

50 


He Lys Leu Ser Ala lie Lys Thr Lys Tyr Leu Arg 

55 60 


Glu Leu Asn 

65 


Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser Glu 

70 75 80 


lie Lys Ala Lys Leu Asp Ala Ala Phe Glu Lys Phe Lys Lys Asp Thr 

85 90 9S 


Leu Lys 


Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu 


100 


105 


110 


Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr 

120 125 


115 


Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Phe Asp 


130 135 

Val Lys Val Lys Glu Ala Glu Leu 

145 150 

Glu Ser Arg Asn Glu Gly Thr lie 

165 

Ser Lys Lys Ala Glu Ala Thr Arg 

180 

Lys Lys Ala Glu Glu Glu Ala Lys 


140 

Glu Leu Val Lys Glu Glu Ala Lys 
155 160 

Lys Gin Ala Lys Glu Lys Val Glu • 
170 l 75 

Leu Glu Asn lie Lys Thr Asp Arg 
185 19° 

Arg Lys Ala Asp Ala Lys Leu Lys 
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195 


200 


205 


Glu Ala Asn Val Ala Thr Ser Asp Gin Gly Lys Pro Lys Gly Arg Ala 


210 


215 


220 


Lys Ar9 GlY *“ Pro 01y 01 “ ^ * hr ASP , ys , ys Glu Asn 


225 


230 


235 


240 


Asp^Ala Lys Ser Ser Asp Ser Ser Val Gly 
245 


Glu Glu Thr Leu Pro Ser 


250 


255 


Ser Ser Leu Lys Ser Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val 


260 


265 


270 


Glu Glu Ala Glu Lys Lys 
275 


Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg 


280 


285 


Asn Tyr Pro Thr Asn Thr 
290 


Tyr Lys Thr Leu Asp Leu Glu 


295 


300 


He Ala Glu 


ASP Val LyS ™ Ly * G1 “ ^ ^ V.X , ys Glu olu 


305 


310 


315 


320 


Ala Lys Glu Pro Arg Asp Glu Glu Lys 
325 


He Lys Gin Ala Lys Ala Lys 
330 335 


Val Glu Ser Lys Lys Ala Glu 


Ala Thr Arg Leu Glu Asn 


340 


345 


He Lys Thr 
350 * 
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Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu 


355 


360 


365 


Asp Lys Val Lys Glu Lys Pro Ala 
370 375 


< 210 > 8 
<211> 663 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 8 

Glu Asn Glu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala Lys 


Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu Tyr lie Glu 


20 


25 


30 


Lys Met Leu Arg Glu lie Gin Leu Asp Arg Arg Lys His Thr Gin Asn 


35 


40 


45 


Val Ala Leu Asn lie Lys Leu Ser Ala lie Lys Thr Lys Tyr Leu Arg 


50 


55 


60 


Glu Leu Asn Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser Glu 


65 


70 


75 


80 


lie Lys Ala Lys Leu Asp Ala Ala Phe Glu Lys Phe Lys Lys Asp Thr 


BNSDOCID: <WO 9951167A2J > 



WO 99/51187 


17 


PCT/US99/07668 


85 


90 


95 


Leu Lys Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu 


100 


105 


110 


Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr 


115 


120 


125 


Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu He Ala Glu Phe Asp 


130 


135 


140 


Val Lys Val Lys Glu Ala Glu Leu Glu 
145 150 


Leu Val Lys Glu Glu Ala Lys 


155 


160 


Glu Ser Arg Asn Glu Gly Thr lie Lys Gin Ala Lys Glu Lys Val Glu 


165 


170 


175 


Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn He Lys Thr Asp Arg 


180 


185 


190 


Lys Lys Ala Glu Glu Glu Ala 
195 


Lys Arg Lys Ala Asp Ala Lys Leu Lys 
200 205 


Glu Ala Asn Val Ala Thr Ser Asp Gin Gly Lys Pro Lys Gly Arg Ala 
210 215 220 


Lys Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn 


225 


230 


235 


240 
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Asp Ala Lys Ser 


Ser Asp Ser Ser 


Val Gly Glu Glu Thr Leu Pro Ser 


245 


250 


255 


Ser Ser Leu Lys 
260 


Ser Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val 


265 


270 


Glu Glu Ala Glu Lys Lys 
275 


Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg 


280 


285 


Asn Tyr 


Pro Thr Asn Thr Tyr Lys 


Thr Leu Asp Leu Glu He Ala Glu 


290 


295 


300 


s.r ASP Val Val Ly. Glu Ala Glu Leu Glu Leu Val lys Glu Glu 

310 315 320 

305 310 


Ala Lys Glu Pro Arg Asp Glu Glu Lys 
325 


lie Lys Gin Ala Lys Ala Lys 
330 335 


Val 


Glu Ser Lys Lys Ala Glu Ala Thr Arg 


Leu Glu Asn He Lys Thr 


340 


345 


350 


Asp Arg Lys Lys 


Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu 


355 


360 


365 


Asp Lys Val Lys Glu Lys Pro 


Ala Glu Gin Pro Gin Pro Ala Pro Ala 


370 


375 


380 


Thr Gin Pro Glu Lys Pro Ala Pro Lys 


385 


390 


Pro Glu Lys Pro Ala Glu Gin 


395 


400 
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Pro Lys Ala Glu Lys Thr Asp Asp Gin Gin Ala Glu Glu Asp Tyr Ala 
405 410 415 

Arg Arg Ser Glu Glu Glu Tyr Asn Arg Leu Thr Gin Gin Gin Pro Pro 
420 425 430 

Lys Thr Glu Lys Pro Ala Gin Pro Ser Thr Pro Lys Thr Gly Trp Lys 
435 440 445 

Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala 
450 455 460 

Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn 

465 470 475 480 

Gly Ala Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr 

485 490 495 

Leu Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly 
500 505 510 

Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly . Trp Leu 
515 520 525 

Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala 
530 535 540 

Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn 


BNSDOCID: <WO 9951 187A2J. 



WO 99/51187 


PCT/U S99/07 668 


20 


545 


550 


555 


560 


Gly Asp Met: 


Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Ty Ty 

575 

565 


570 


Leu Asn 


, „ al _ Thr Glv Trp Leu Gin Tyr Asn Gly 

Ala Asn Gly Asp Met Ala Thr ury v 

590 


580 


585 


r Ala Asn Gly Asp Met Ala Thr Gly Trp Val 

Ser Trp Tyr Tyr Leu Asn Ala Asn ury 


595 


600 


605 


Lys Asp Gly Asp Thr Trp Tyr Tyr Leu 

615 


Glu Ala Ser Gly Ala Met Lys 
620 


610 


Ala Ser Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gly 


625 


630 


635 


640 


s« Gly Al. Leu Al. V.l Asn Thr Thr V.l Asp Gly Tyr Gly V.l Asn 


645 


650 


655 


Ala Asn Gly Glu Trp Val Asn 
660 


<210> 9 
<211> 254 
<212> PRT 

<213> Streptococcus pneumoniae 
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< 400> 9 

Glu Asn Glu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala Lys 
1 5 10 15 


Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu Tyr Xle Glu 
2° 25 30 


Lys Met Leu Arg Glu lie Gin Leu Asp Arg Arg Lys His Thr Gin Asn 
35 40 45 


Val Ala Leu Asn He Lys Leu Ser Ala lie Lys Thr Lys Tyr Leu Arg 


50 


55 


60 


Glu Leu Asn Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser Glu 


65 


70 


75 


80 


lie Lys Ala Lys Leu Asp Ala Ala Phe Glu Lys Phe Lys Lys Asp Thr 


85 


90 


95 


Leu Lys Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu 


100 


105 


110 


Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr 


115 


120 


125 


Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Phe Asp 


130 


135 


140 


Val Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys 
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145 


150 


155 


160 


Glu Ser Arg Asn 


Glu Gly Thr 


165 


lie Lys Gin Ala Lys 


Glu Lys Val Glu 


170 


175 


Ser Lys Lys Ala Glu Ala 
180 


Thr Arg Leu Glu Asn 


185 


lie Lys Thr Asp Arg 
190 


Lys Lys Ala 
195 


Glu Glu Glu Ala Lys 
200 


Arg Lys Ala Asp 


Ala Lys Leu Lys 
205 


Glu Ala Asn 


Val Ala Thr Ser Asp 


Gin Gly Lys 


210 


215 


Pro Lys Gly Arg Ala 
220 


Lys Arg Gly Val 
225 


Pro Gly Glu Leu 


230. 


Ala Thr Pro Asp Lys Lys Glu Asn 


235 


240 


Asp Ala Lys 


Ser Ser Asp Ser Ser Val 


Gly Glu Glu Thr Leu 


245 


250 


< 210 > 10 
< 211 > 106 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 10 

Lys Ser Gly Lys Lys Val Ala 


Glu Ala Glu Lys 
10 


Lys Val Glu Glu Ala 
15 
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Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro 
■ 20 25 30 


Thr Asn Thr Tyr Lys Thr Leu Asp Leu Glu lie Ala Glu Ser Asp Val 
35 40 45 


Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu 
50 55 60 


Pro Arg Asp Glu Glu Lys lie Lys Gin Ala Lys Ala Lys Val Glu Ser 
65 70 75 80 


Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg Lys 
85 90 95 

Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala 
100 105 


< 210 > 11 
<211> 107 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 11 

Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu Ala Lys 
1 5 10 15 


9951187A2_I_> 
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Lys Lys 


01. ASP am Ay. Oi. =1- »« Ara *“ ^ Pt ° Tht 

2 o 


A»n Tht Tyr Lys Tht Leu Glu Leu Glu lie Ala GU 


Phe Asp Val Lys 
45 


Val Lys Glu Ala 


Glu Leu Glu Leu Val Lys 


50 


55 


Glu Glu Ala Lys Glu Ser 
60 


„ G1U Gly Tht XI. Lys Gin nm Ays Glu Lys V.l Glu Get Lys 

6 s ™ ” ... 80 


. . t rlu Asn He Lys Thr Asp Arg Lys Lys 

Lys Ala Glu Ala Thr Arg Leu Glu Asn 


Ala Glu Glu Glu Ala Lys Arg 


Lys Ala Asp Ala 


100 


105 


< 210 > 12 
<211> 1219 
<212> DNA 

<213> Streptococcus pneumoniae 


<400> 12 

gagaacgagg gagctaccca agtacccact 
gaacaaggag aacaacctaa aaaactcgat 
gaggaatatg taaaaaaaat agtgggtgag 
acaattactg tagctctagt taacgagttg 


tcttctaata gggcaaatga aagtcaggca 60 
tcagaacgag ataaggcaag gaaagaggtc 120 
agctatgcaa aatcaactaa aaagcgacat 180 
aacaacatta agaacgagta tttgaataaa 240 
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atagttgaat 

caacctcaga 

aagccaacta 

cagatactga 

tgatggagag 

tcgatcaaaa 

300 

gtagatgaag 

ctgtgtctaa 

gtttgaaaag 

gactcatctt 

c ttcgtcaag 

ttcagactct 

360 

tccactaaac 

cggaagcttc 

agatacagcg 

aagccaaaca 

agccgacaga 

accaggagaa 

420 

aaggtagcag 

aagccaagaa 

gaaggttgaa 

gaagctgaga 

aaaaagccaa 

ggatcaaaaa 

480 

gaagaagatc 

gtcgtaacta 

cccaaccatt 

acttacaaaa 

cgcttgaact 

tgaaattgct 

540 

gagtccgatg 

tggaagttaa 

aaaagcggag 

cttgaactag 

taaaagtgaa 

agctaacgaa 

600 

cctcgagacg 

agcaaaaaat 

taagcaagca 

gaagcggaag 

ttgagagtaa 

acaagctgag 

660 

gctacaaggt 

taaaaaaaat 

caagacagat 

cgtgaagaag 

cagaagaaga 

agctaaacga 

720 

agagcagatg 

ctaaagagca 

aggtaaacca 

aaggggcggg 

caaaacgagg 

agttcctgga 

780 

gagctagcaa 

cacctgataa 

aaaagaaaat 

gatgcgaagt 

cttcagattc 

tagcgtaggt 

840 

gaagaaactc 

ttccaagccc 

atccctgaaa 

ccagaaaaaa 

aggtagcaga 

agctgagaag 

900 

aaggttgaag 

aagctaagaa 

aaaagccgag 

gatcaaaaag 

aagaagatcg 

ccgtaactac 

960 

ccaaccaata 

cttacaaaac 

gcttgaactt 

gaaattgctg 

agtccgatgt 

ggaagttaaa 

1020 

aaagcggagc 

ttgaactagt 

aaaagaggaa 

gctaaggaac 

ctcgaaacga 

ggaaaaagtt 

1080 

aagcaagcaa 

aagcggaagt 

tgagagtaaa 

aaagctgagg 

c tacaaggtt 

agaaaaaatc 

1140 

aagacagatc 

gtaaaaaagc 

agaagaagaa 

gctaaacgaa 

aagcagcaga 

agaagataaa 

1200 

gttaaagaaa 

aaccagctg 





1219 


<210> 13 
<211> 1969 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 13 

gagaacgagg gagctaccca agtacccact tcttctaata gggcaaatga aagtcaggca 60 
gaacaaggag aacaacctaa aaaactcgat tcagaacgag ataaggcaag gaaagaggtc 120 
gaggaatatg taaaaaaaat agtgggtgag agctatgcaa aatcaactaa aaagcgacat 180 
acaattactg tagctctagt taacgagttg aacaacatta agaacgagta tttgaataaa 240. 
atagttgaat caacctcaga aagccaacta cagatactga tgatggagag tcgatcaaaa 300 
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gtagatgaag 

ctgtgtctaa 

gtttgaaaag 

gactcatctt 

c ttcgtcaag 

ttcagactct 

360 

tccactaaac 

cggaagctte 

agatacagcg 

aagccaaaca 

agccgacaga 

accaggagaa 

420 

aaggtagcag 

aagctaagaa 

gaaggt tgaa 

gaagctgaga 

aaaaagccaa 

ggatcaaaaa 

480 

gaagaagatc 

gtcgtaacta 

cccaaccatt 

acttacaaaa 

cgcttgaac t 

tgaaat tgct 

540 

gagtccgatg 

tggaagttaa 

aaaagcggag 

cttgaactag 

taaaagtgaa 

agctaacgaa 

600 

cctcgagacg 

agcaaaaaat 

taagcaagca 

gaagcggaag 

ttgagagtaa 

acaagctgag 

660 

gctacaaggt 

taaaaaaaat 

caagacagat 

cgtgaagaag 

cagaagaaga 

agctaaacga 

720 

agagcagatg 

ctaaagagca 

aggtaaacca 

aaggggcggg 

caaaacgagg 

agttcctgga 

780 

gagctagcaa 

cacctgataa 

aaaagaaaat 

gatgcgaagt 

cttcagattc 

tagcgtaggt 

840 

gaagaaactc 

ttccaagccc 

atccctgaaa 

ccagaaaaaa 

aggtagcaga 

agctgagaag 

900 

aaggttgaag 

aagctaagaa 

aaaagccgag 

gatcaaaaag 

aagaagatcg 

ccgtaactac 

960 

ccaaccaata 

cttacaaaac 

gcttgaactt 

gaaattgctg 

agtccgatgt 

ggaagttaaa 

1020 

aaagcggagg 

cttgaactag 

taaaagagga 

agctaaggaa 

cctcgaaacg 

aggaaaaagt 

1080 

taagcaagca 

aaagcggaag 

ttgagagtaa 

aaaagctgag 

gctacaaggt 

tagaaaaaat 

1140 

caagacagat 

cgtaaaaaag 

cagaagaaga 

agctaaacga 

aaagcagcag 

aagaagataa 

1200 

agttaaagaa 

aaaccagctg 

aacaaccaca 

accagcgccg 

gctccaaaag 

cagaaaaacc 

1260 

agctccagct 

ccaaaaccag 

agaatccagc 

tgaacaacca 

aaagcagaaa 

aaccagctga 

1320 

tcaacaagct 

gaagaagact 

atgctcgtag 

atcagaagaa 

gaatataatc 

gcttgactca 

1380 

acagcaaccg 

ccaaaaactg 

aaaaaccagc 

acaaccatct 

actccaaaaa 

caggctggaa 

1440 

acaagaaaac 

ggtatgtggt 

acttctacaa 

tactgatggt 

tcaatggcga 

caggatggct 

1500 

ccaaaacaat 

ggctcatggt 

actacctcaa 

cagcaatggc 

gctatggcga 

caggatggct 

1560 

ccaaaacaat 

ggttcatggt 

actatctaaa 

cgctaatggt 

tcaatggcaa 

caggatggct 

1620 

ccaaaacaat 

ggttcatggt 

actacctaaa 

cgctaatggt 

tcaatggcga 

caggatggct 

1680 

ccaatacaat 

ggctcatggt 

actacctaaa 

cgctaatggt 

tcaatggcga 

caggatggct 

1740 

ccaatacaat 

ggctcatggt 

actacctaaa 

cgctaatggt 

gatatggcga 

caggttgggt 

1800 

gaaagatgga 

gatacctggt 

actatcttga 

agcatcaggt 

gctatgaaag 

caagccaatg 

1860 

gttcaaagta 

tcagataaat 

ggtactatgt 

caatggctca 

ggtgcccttg 

cagtcaacac 

1920 

aactgtagat 

ggctatggag 

tcaatgccaa 

tggtgaatgg 

gtaaactaa 


1969 
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<2 10> 14 
<2 11> 853 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 14 

gagaacgagg gagctaccca agtacccact tcttctaata gggcaaatga aagtcaggca 60 

93-scaaggag aacaacctaa aaaactcgat tcagaacgag ataaggcaag gaaagaggtc 120 

gaggaatatg taaaaaaaat agtgggtgag agctatgcaa aatcaactaa aaagcgacat 180 

acaattactg tagctctagt taacgagttg aacaacatta agaacgagta tttgaataaa 240 

atagttgaat caacctcaga aagccaacta cagatactga tgatggagag tcgatcaaaa 300 

gtagatgaag ctgtgtctaa gtttgaaaag gactcatctt cttcgtcaag ttcagactct 360 

tccactaaac cggaagcttc agatacagcg aagccaaaca agccgacaga accaggagaa 420 

aaggtagcag aagctaagaa gaaggttgaa gaagctgaga aaaaagccaa ggatcaaaaa 480 

gaagaagatc gtcgtaacta cccaaccatt acttacaaaa cgcttgaact tgaaattgct 540 

gagtccgatg tggaagttaa aaaagcggag cttgaactag taaaagtgaa agctaacgaa 600 

cctcgagacg agcaaaaaat taagcaagca gaagcggaag ttgagagtaa acaagctgag 660 

gctacaaggt taaaaaaaat caagacagat cgtgaagaag cagaagaaga agctaaacga 720 

agagcagatg ctaaagagca aggtaaacca aaggggcggg caaaacgagg agttcctgga 780 

gagctagcaa cacctgataa aaaagaaaat gatgcgaagt cttcagattc tagcgtaggt 840 

gaagaaactc ttc 053 

<210> 15 
<211> 318 
<212> DNA 

< 213> Streptococcus pneumoniae . 

<400> 15 

aaaccagaaa aaaaggtagc agaagctgag aagaaggttg aagaagctaa gaaaaaagcc 60 
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gaggatcaaa 

aagaagaaga 

tcgccgtaac 

tacccaacca 

atacttacaa 

aacgcttgaa 

120 

cttgaaattg 

ctgagtccga 

tgtggaagtt 

aaaaaagcgg 

agcttgaact 

agtaaaagag 

180 

gaagctaagg 

aacctcgaaa 

cgaggaaaaa 

gttaagcaag 

caaaagcgga 

agttgagagt 

240 

aaaaaagctg 

aggctacaag 

gttagaaaaa 

atcaagacag 

atcgtaaaaa 

agcagaagaa 

300 

gaagctaaac 

gaaaagca 





318 


<210> 16 
<211> 327 
<212> DNA 

<213> Streptococcus pneumoniae 


<400> 16 


acagaaccag 

gagaaaaggt 

agcagaagct 

aagaagaagg 

ttgaagaagc 

tgagaaaaaa 

60 

gccaaggatc 

aaaaagaaga 

agatcgtcgt 

aactacccaa 

ccattactta 

caaaacgctt 

120 

gaacttgaaa 

ttgctgagtc 

cgatgtggaa 

gttaaaaaag 

cggagcttga 

actagtaaaa 

180 

gtgaaagcta 

acgaacctcg 

agacgagcaa 

aaaattaagc 

aagcagaagc 

ggaagttgag 

240 

agtaaacaag 

ctgaggctac 

aaggttaaaa 

aaaatcaaga 

cagatcgtga 

agaagcagaa 

300 

gaagaagcta 

aacgaagagc 

agatgct 




327 


<2 10> 17 
<211> 1129 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 17 

gaaaacgaag gaagtaccca agcagccact tcttctaata tggcaaagac agaacatagg 60 
aaagctgcta aacaagtcgt cgatgaatat atagaaaaaa tgttgaggga gattcaacta 120 
gatagaagaa aacataccca aaatgtcgcc ttaaacataa agttgagcgc aattaaaacg 180 
aagtatttgc gtgaattaaa tgttttagaa gagaagtcga aagatgagtt gccgtcagaa 240 
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ataaaagcaa agttagacgc agcttttgag aagcttaaaa aagatacatt gaaaccagga 300 
gaaaaggtag cagaagctaa gaagaaggtt gaagaagcta agaaaaaagc cgaggatcaa 360 
aaagaagaag atcgtcgtaa ctacccaacc aatacttaca aaacgcttga acttgaaatt .420 
gctgagttcg atgtgaaagt taaagaagcg gagcttgaac tagtaaaaga ggaagctaaa 480 
gaatctcgaa acgagggcac aattaagcaa gcaaaagaga aagttgagag taaaaaagct 540 
gaggctacaa ggttagaaaa catcaagaca gatcgtaaaa aagcagaaga agaagctaaa 600 
cgaaaagcag atgctaagtt gaaggaagct aatgtagcga cttcagatca aggtaaacca 660 
aaggggcggg caaaacgagg agttcctgga gagctagcaa cacctgataa aaaagaaaat 720 
gatgcgaagt cttcagattc tagcgtaggt gaagaaactc ttccaagctc atccctgaaa 780 
tcaggaaaaa aggtagcaga agctgagaag aaggttgaag aagctgagaa aaaagccaag 840 
gatcaaaaag aagaagatcg ccgtaactac ccaaccaaca cttacaaaac gcttgacctt 900 
gaaattgctg agtccgatgt gaaagttaaa gaagcggagc ttgaactagt aaaagaggaa 960 
gctaaggaac ctcgagacga ggaaaaaatt aagcaagcaa aagcgaaagt tgagagtaaa 1020 
aaagctgagg ctacaaggtt agaaaacatc aagacagatc gtaaaaaagc agaagaagaa 1080 
gctaaacgaa aagcagcaga agaagataaa gttaaagaaa aaccagctg 1129 

<210> 18 
<211> 1992 
<212> DNA 

<213> Streptococcus pneumoniae 


<400> 18 

gaaaacgaag gaagtaccca agcagccact tcttctaata tggcaaagac agaacatagg 60 
aaagctgcta aacaagtcgt cgatgaatat atagaaaaaa tgttgaggga gattcaacta 120 
gatagaagaa aacataccca aaatgtcgcc ttaaacataa agttgagcgc aattaaaacg 180 
aagtatttgc .gtgaattaaa tgttttagaa gagaagtcga aagatgagtt gccgtcagaa 240 
ataaaagcaa agttagacgc agcttttgag aagtttaaaa aagatacatt gaaaccagga 300 
gaaaaggtag cagaagctaa gaagaaggtt gaagaagcta agaaaaaagc cgaggatcaa 360 
aaagaagaag atcgtcgtaa ctacccaacc aatacttaca aaacgcttga acttgaaatt 420 
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gctgagttcg 

atgtgaaagt 

taaagaagcg 

gagcttgaac 

tagtaaaaga 

ggaagctaaa 

480 

gaatctcgaa 

acgagggcac 

aattaagcaa 

gcaaaagaga 

aagttgagag 

taaaaaagct 

540 

gaggctacaa 

ggttagaaaa 

catcaagaca 

gatcgtaaaa 

aagcagaaga 

agaagctaaa 

600 

cgaaaagcag 

atgccaagtt 

gaaggaagct 

aatgtagcga 

ct tcagatca 

aggtaaacca 

660 

aaggggcggg 

caaaacgagg 

agttcctgga 

gagctagcaa 

cacctgataa 

aaaagaaaat 

720 

gatgcgaagt 

cttcagattc 

tagcgtaggt 

gaagaaactc 

ttccaagctc 

atccctgaaa 

780 

tcaggaaaaa 

aggtagcaga 

agctgagaag 

aaggttgaag 

aagctgagaa 

aaaagccaag 

840 

gatcaaaaag 

aagaagatcg 

ccgtaactac 

ccaaccaata 

cttacaaaac 

gcttgacctt 

900 

gaaattgctg 

agtccgatgt 

gaaagttaaa 

gaagcggagc 

ttgaactagt 

aaaagaggaa 

960 

gctaaggaac 

ctcgagacga 

ggaaaaaatt 

aagcaagcaa 

aagcgaaagt 

tgagagtaaa 

1020 

aaagctgagg 

ctacaaggtt 

agaaaacatc 

aagacagatc 

gtaaaaaagc 

agaagaagaa 

1080 

gctaaacgaa 

aagcagcaga 

agaagataaa 

gttaaagaaa 

aaccagctga 

acaaccacaa 

1140 

ccagcgccgg 

ctactcaacc 

agaaaaacca 

gctccaaaac 

cagagaagcc 

agctgaacaa 

1200 

ccaaaagcag 

aaaaaacaga 

tgatcaacaa 

gctgaagaag 

actatgctcg 

tagatcagaa 

1260 

gaagaatata 

atcgcttgac 

tcaacagcaa 

ccgccaaaaa 

ctgaaaaacc 

agcacaacca 

1320 

tctactccaa 

aaacaggctg 

gaaacaagaa 

aacggtatgt 

ggtacttcta 

caatactgat 

1380 

ggttcaatgg 

caacaggatg 

gctccaaaac 

aacggttcat 

ggtactatct 

aaacgctaat 

1440 

ggtgctatgg 

cgacaggatg 

gctccaaaac 

aatggttcat 

ggtactatct 

aaacgctaat 

1500 

ggttcaatgg 

caacaggatg 

gctccaaaac 

aatggttcat 

ggtactacct 

aaacgctaat 

1560 

ggtgctatgg 

cgacaggatg 

gctccaatac 

aatggttcat 

ggtactacct 

aaacagcaat 

1620 

ggcgctatgg 

cgacaggatg 

gctccaatac 

aatggctcat 

ggtactacct 

caacgctaat 

1680 

ggtgatatgg 

cgacaggatg 

gctccaaaac 

aacggttcat 

ggtactacct 

caacgctaat 

1740 

ggtgatatgg 

cgacaggatg 

gctccaatac 

aacggttcat 

ggtattacct 

caacgctaat 

1800 

ggtgatatgg 

cgacaggttg 

ggtgaaagat 

ggagatacct 

ggtactatct 

tgaagcatca 

1860 

ggtgctatga 

aagcaagcca 

atggttcaaa 

gtatcagata 

aatggtacta 

tgtcaatggc 

1920 

tcaggtgccc 

ttgcagtcaa 

cacaactgta 

gatggctatg 

gagtcaatgc 

caatggtgaa 

1980 

tgggtaaact 

aa 





1992 


<210> 19 
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BNSDOCID: <WO_ 


<211> 763 
<212> DNA 

Streptococcus pneumoniae 


<400> 19 

gaaaacgaag 

aaagctgcta 

gatagaagaa 

aagtatttgc 

ataaaagcaa 

gaaaaggtag 

- 

aaagaagaag 

gctgagttcg 

gaatctcgaa 

gaggctacaa 

cgaaaagcag 

aaggggcggg 

gatgcgaagt 


gaagtaccca 

aacaagtcgt 

aacataccca 

gtgaattaaa 

agttagacgc 

cagaagctaa 

atcgtcgtaa 

atgtgaaagt 

acgagggcac 

ggttagaaaa 

atgctaagtt 

caaaacgagg 

cttcagattc 


agcagccact 

cgatgaatat 

aaatgtcgcc 

tgttttagaa 

agcttttgag 

gaagaaggtt 

ctacccaacc 

taaagaagcg 

aattaagcaa 

catcaagaca 

Snaaggaagct 

agttcctgga 

tagcgtaggt 


tcttctaata 

atagaaaaaa 

ttaaacataa 

gagaagtcga 

aagtttaaaa 

gaagaagcta 

aatacttaca 

gagcttgaac 

gcaaaagaga 

gatcgtaaaa 

aatgtagcga 

gagctagcaa 

gaagaaactc 


tggcaaagac 

tgttgaggga 

agttgagcgc 

aagatgagtt 

aagatacatt 

agaaaaaagc 

aaacgcttga 

tagtaaaaga 

aagttgagag 

aagcagaaga 

ct tcagatca 

cacctgataa 

ttc 


agaacatagg 60 
gattcaacta 120 
aattaaaacg 180 
gccgtcagaa 240 
gaaaccagga 300 
cgaggatcaa 360 
acttgaaatt 420 
ggaagctaaa 480 
taaaaaagct 540 
agaagctaaa 600 
aggtaaacca 660 
aaaagaaaat 720 
763 


<210> 20 
<211> 318 
<212> DNA 

<213> Streptococcus pneumoniae 


<400> 20 

a.atcagg.a a “* 33 “ aC aagaaggttg aagaagctg. gaaaaa.gcc 60 
aaggatcaaa aagaagaag, tcgccgtaac tacccaacca atacttacaa aacgcttg.c 120 
cttgaaattg ctgagtccg. tgtgaaagtt aaagaagcgg agcttgaacc agtaaaagag 180 
ga.gct.agg aacctcgaga cgaggaa.a, actaagcaag =.a..g c3 „ .g t tg.g. gt 240 
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aaaaaagctg aggctacaag gttagaaaac atcaagacag atcgtaaaaa agcagaagaa 300 

318 

gaagctaaac gaaaagca 

< 2 10 > 21 
<211> 321 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 21 

ccaggagaaa aggtagcaga agctaagaag aaggttgaag aagctaagaa aaaagccgag 60 
gatcaaaaag aagaagatcg tcgtaactac ccaaccaata cttacaaaac gcttgaactt 120 

gaaattgctg agttcgatgt gaaagttaaa gaagcggagc ttgaactagt aaaagaggaa 180 

gctaaagaat ctcgaaacga gggcacaatt aagcaagcaa aagagaaagt tgagag.taaa 240 

aaagctgagg ctacaaggtt agaaaacatc aagacagatc gtaaaaaagc agaagaagaa 300 

_ „ 321 

gctaaacgaa aagcagatgc t 

<210> 22 
<211> 121 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 22 

Ser Pro Ser Leu Lys Pro Glu Lys 

1 5 

Val Glu Glu Ala Lys Lys Lys Ala 

20 

Arg Asn Tyr Pro Thr Asn Thr Tyr 


Lys Val Ala Glu Ala Glu Lys Lys 

10 is 

Glu Asp Gin Lys Glu Glu Asp Arg 
25 30 

Lys Thr Leu Glu Leu Glu Xle Ala 
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40 


45 


Glu Ser Asp Val Glu Val Lys Lys Ala 


Glu Leu Glu Leu Val Lys Glu 


50 


55 


60 


Glu Ala Lys Glu Pro Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala 
65 7 0 


75 


80 


Glu Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys He Lys 


85 


90 


95 


Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu 


100 


105 


110 


Glu Asp Lys Val Lys Glu Lys Pro Ala 


115 


120 


<210> 23 
<211> 122 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 23 


Pro Ser s.r Ser Lee Lys Ser Gly Lys Lys Val Ala Glu Ala Glu Lys 


10 


15 


Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp 


20 


25 


30 
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Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Asp Leu Glu He 


35 


40 


45 


Ala Glu Ser Asp Val Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys 
50 55 60 


Glu Glu Ala Lys Glu Pro Arg Asp Glu Glu Lys lie Lys Gin Ala Lys 
65 70 75 80 


Ala Lys Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie 


85 


90 


95 


Lys Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala 


100 


105 


110 


Glu Glu Asp Lys Val Lys Glu Lys Arg Ala 
115 120 


<210> 24 
<211> 428 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 24 

Glu Asn Glu Gly Ala Thr Gin Val Pro Thr Ser Ser Asn Arg Ala Asn 


BNSDOCID: <WO 99S11B7A2 I > 



BNSDOCID: <WO. 


WO 99/51 187 


35 


PCT/US99/07668 


Glu Ser Gin Ala Glu Gin Gly Glu Gin Pro Lys Lys 


Leu Asp Ser Glu 


20 


25 


30 


Arg Asp Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys He Val 


35 


40 


45 


Gly Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr He Thr Val 


50 


55 


60 


Ala Leu Val Asn Glu Leu Asn Asn lie 
65 70 


Lys Asn Glu Tyr Leu Asn Lys 


75 


80 


lie Val Glu Ser Thr Ser Glu Ser Gin 


Leu Gin lie Leu Met Met Glu 


85 


90 


95 


Ser Arg Ser Lys Val Asp Glu Ala Val 


Ser Lys Phe Glu Lys Asp Ser 


100 


105 


110 


Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr 

120 


Lys Pro Glu Ala Ser Asp 
125 


Thr Ala Lys Pro Asn Lys Pro Thr Glu 


Pro Gly Glu Lys Val Ala Glu 


130 


135 


140 


Ala Lys Lys Lys Val Glu Glu Ala Glu 
145 150 


Lys Lys Ala Lys Asp Gin Lys 


155 


160 


Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu Glu 


165 


170 


175 
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Leu Glu He Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu 


180 


185 


190 


Leu Val Lys Val Lys Ala Asn Glu Pro Arg Asp Glu Gin Lys He Lys 


195 


200 


205 


Gin Ala Glu Ala Glu Val Glu Ser Lys Gin Ala Glu Ala Thr Arg Leu 
210 215 220 

Lys Lys lie Lys Thr Asp Arg Glu Glu Ala Glu Glu Glu Ala Lys Arg 

225 230 235 240 

Arg Ala Asp Ala Lys Glu Gin Gly Lys Pro Lys Gly Arg Ala Lys Arg 

245 250 255 


Gly val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp Ala 
260 265 270 

Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro Ser Pro Ser 
275 280 285 


Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu 
290 295 300 


Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr 

305 310 315 320 

Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp 


BNSDOCID: <WO 9951 187 A? 1 > 



WO 99/51 187 


37 


PCT/US99/07668 
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325 


330 


335 


Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys 
340 345 350 


Glu Pro Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala Glu Val Glu 
355 360 365 


Ser Lys Lys Ala Glu Ala Thr Arg 
370 375 

Lys Lys Ala Glu Glu Glu Ala Lys 
385 390 

Val Lys Glu Lys Pro Ala Glu Gin 
405 


Leu Glu Lys lie Lys Thr Asp Arg 
380 

Arg Lys Ala Ala Glu Glu Asp Lys 

. . . ly- n L m 

395 •- 400 

Pro Gin Pro Ala Pro Ala Pro Lys 
410 415 


Ala Glu Lys Pro Ala Pro Ala Pro Lys Pro Glu Asn 
420 425 


<210> 25 
<211> 23 
<212> DNA 

< 213> Streptococcus pneumoniae 
<400> 25 

ggcggatcca tggaraayga rgg 


23 
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<2 10> 26 
<211> 33 
<2 12> DNA 

<2 1 3> Streptococcus pneumoniae 
<400> 26 

gccgtcgact tagtttaccc attcaccatt ggc 33 

<210> 27 
<211> 5 
<212> PRT 

<213 > Streptococcus pneumoniae 
<400> 27 

Xaa Glu Asn Glu Gly 
1 5 

<210> 28 
<211> 439 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 28 

Ala Val Ala Ser Leu Phe Met Gly Ser Val Val His Ala Thr Glu Lys 
15 10 15 

Glu Val Thr Thr Gin Val Ala Thr Ser Ser Asn Lys Ala Asn Lys Ser 
20 25 30 
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Gin Thr Glu His 
35 


Met Lys Ala Ala Lys Gin 


Val Asp Glu Tyr lie Lys 


40 


45 


” L *“ G1 " L *“ ASP Ar3 **» Gin Asn Gly Leu 

50 


Leu Thr Lys Leu Gly Val 


65 


70 


He Lys Thr Glu Ty r Leu His Gly Leu Ser 


75 


80 


Val Ser Lys Lys Lys Ser Glu 


Ala Glu Leu Pro Ser Glu 


85 


90 


lie Lys Ala 
95 


Lys Leu Asp Ala Ala Phe Glu 


Gin Phe Lys Lys Asp Thr 


100 


105 


Leu Pro Thr 
110 


Glu Pro Gly Lys Lys Val 
115 


Ala Glu Ala Glu Lys Lys 
120 


Val Glu Glu Ala 
125 


Lys Lys Lys Ala Glu Asp Gin 


130 


135 


Lys Glu Lys Asp Leu Arg Asn Tyr 
140 


Pro 


Thr Asn Thr Tyr Lys Thr 


145 


150 


Leu Glu Leu Asp He Ala 


Glu Ser Asp Val 


155 


160 


Glu Val Lys Lys Ala Glu 
165 


Leu Glu Leu Val Lys Glu 


Glu Ala Lys Glu 


170 


175 


Ser Arg Asp Glu Lys Lys lie 


Asn Gin Ala Lys Ala Lys 


Val Glu Asn 
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180 


185 


190 


Lys Lys Ala Glu Ala Thr Arg Leu Lys Asn He Lys Thr Asp Arg Glu 


195 


200 


205 


Lys Ala Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Leu Gin Glu Ala 
210 215 220 


Asn Val Ala Thr Ser Glu Gin Asp Lys Ser Lys Arg Arg Ala Lys Arg 
225 230 235 240 


Glu Val Xaa Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp Ala 


245 


250 


255 


Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Thr Ser Pro Ser 


260 


265 


270 


Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu 


275 


280 


285 


Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr 
290 295 300 


Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp 

oin 315 320 

305 310 J> ' LO 


Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys 
325 330 335 
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Glu Ser Arg Asn Glu Glu Lys lie 


Lys Gin Val Lys Ala Lys Val Glu 
340 345 350 


Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu 
355 360 


Asn He Lys Thr Asp Arg 
365 


Lys Lys Ala Glu Glu Glu Glu Ala Lys Arg Arg Ala Ala Glu Glu Asp 


370 


375 


380 


Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro 
385 390 


395 


400 


Gin Pro Glu Lys Pro Thr Glu Glu Pro Glu Asn Pro Ala 
405 410 


Pro Ala Pro 
415 


Ala Pro Lys Pro Glu Asn Pro Ala Glu Lys Pro Lys Ala Glu Lys Pro 
420 425 


430 


Ala Asp Gin Gin Ala Glu Glu 
435 


<210> 29 
<211> 437 
<212> PRT 

Streptococcus pneumoniae 


<400> 29 

Ala Val Ala Ser Leu Phe Met Gly Ser Val Val His Ala Thr Glu Lys 
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1 


5 


10 


15 


Glu Val Thr Thr Gin Val Ala Thr Ser Ser Asn Arg Ala Asn Lys Ser 


20 


25 


30 


Gin Thr Glu His Met Lys Ala Ala Lys Gin Val Asp Glu Tyr lie Lys 
35 40 45 


Lys Lys Leu Gin Leu Asp Arg Arg Lys His Thr Gin Asn Val Gly Leu 
50 55 50 


Leu Thr Lys Leu Gly Val He Lys Thr Glu Tyr Leu His Gly Leu Ser 


65 


70 


75 


80 


Ser Lys Lys Lys Ser Glu Ala Glu Leu Pro Ser Glu lie Lys Ala 
85 90 95 


Lys Leu Asp Ala Ala Phe Glu Gin Phe Lys Lys Asp Thr Leu Pro Thr 
100 105 110 


Glu Pro Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala 
115 120 125 


Lys Lys Lys Ala Glu Asp Gin Lys Glu Lys Asp Leu Arg Asn Tyr Pro 
130 135 140 


Thr Asn Thr Tyr Lys Thr Leu Glu Leu Asp lie Ala Glu Ser Asp Val 


145 


150 


155 


160 
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Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu 


165 


170 


Glu Ala Lys Glu 
175 


S " it9 ASP G1 “ LyS LyS Ile As " «» Ly. Ala ,ys v.! olu Asn 


180 


185 


190 


Lys Lys Ala Glu Ala Thr Arg Leu 


195 


200 


Lys Asn lie Lys Thr Asp Arg Glu 
205 


Lys Ala Glu Glu Ala Lys Arg Arg Ala 


210 


215 


Asp Ala Lys Leu Gin Glu Ala 
220 


Asn Val Ala Thr 
225 


Ser Glu Gin Asp Lys Ser Lys Arg Arg Ala 


230 


235 


Lys Arg 
240 


Glu Val Leu Gly Glu Leu Ala 
245 


Thr Pro Asp Lys Lys Glu Asn Asp Ala 


250 


255 


Lys Ser Ser Asp Ser Ser 


Val Gly Glu Glu Thr Leu Thr 


260 


265 


Ser Pro Ser 
270 


Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu 


275 


280 


Lys Lys Val 
285 


Glu Glu 


Ala Lys Lys Lys Ala Glu 
290 


Asp Gin Lys Glu Glu Asp Arg Arg' Asn Tyr 


295 


300 


Pro Thr Asn Thr Tyr Lys Thr 


305 


310 


Leu Glu Leu Glu lie Ala Glu 
315 


Ser Asp 
320 
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Val Glu Val Lys Lys Ala Glu Leu Glu 


Leu Val Lys Glu Glu Ala Lys 


325 


330 


335 


Glu Ser Arg Asn Glu Glu Lys lie Lys 


Gin Val Lys Ala Lys Val Glu 


340 


345 


350 


Ser Lys Lys Ala Glu 


Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg 


355 


360 


365 


Lys Lys Ala Glu Glu Glu Glu Ala Lys Arg Arg Ala Ala Glu Glu Asp 


370 


375 


380 


Lys Val 
385 


Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro 


390 


395 


400 


Gin Pro Glu Lys Pro Thr Glu Glu Pro Glu Asn Pro Ala Pro Ala Pro 


405 


410 


415 


Ala Pro Lys Pro Glu Asn Pro Ala Glu Lys Pro Lys Ala Glu Lys Pro 


420 


425 


430 


Ala Asp Gin Gin Ala 
435 


<210> 30 
<211> 439 
<212> PRT 
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<213> Streptococcus pneumoniae 


<400> 30 

Val Ala Val Ala Ser Leu Val Met Gly Ser Val Val His Ala Thr Glu 


Lys Glu Val Thr Thr Gin Val Ala Thr Ser 


Ser Asn Arg Ala Asn Glu 


20 


25 


30 


Ser Gin Ala Gly His Arg Lys Ala Ala Glu Gin Phe Asp Glu Tyr lie 


35 


40 


45 


Lys Thr Met lie Gin Leu Asp Arg Arg Lys 


His Thr Gin Asn Phe Ala 


50 


55 


60 


Leu Asn lie Lys Leu Ser Arg He Lys Thr 
« 70 


Glu Tyr Leu Arg Lys Leu 


75 


80 


Asn Val Leu Glu Glu Lys Ser Lys Ala Glu 


Leu Pro Ser Glu Thr Lys 


85 


90 


95 


Lys Glu lie Asp Ala Ala Phe Glu Gin 


100 


105 


Phe Lys Lys Asp Thr Asn Arg 
110 


Thr Lys Lys Thr Val Ala Glu Ala Glu 


Lys Lys Val Glu Glu Ala 


115 


120 


125 


Lys 


Lys Lys Ala Lys Ala Gin Lys Glu Glu 


Asp His Arg Asn Tyr Pro Thr 


130 


135 


140 
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Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp Val Glu 


145 


150 


155 


160 


Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Ser 
165 170 175 


Arg Asp Asp Glu Lys He Lys Gin Ala Glu Ala Lys Val Glu Ser Lys 
180 185 190 


Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg Glu Lys 
195 200 205 


Ala Glu Glu Glu Ala Lys Arg Arg Ala Glu Ala Lys Leu Lys Glu Ala 
210 215 220 


Val Glu Lys Asn Val Ala Thr Ser Glu Gin Asp Lys Pro Lys Gly Arg 


225 


230 


235 


240 


Arg Lys Arg Gly Val Pro Gly Glu Gin Ala Thr Pro Asp Lys Lys Glu 
245 250 255 


Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Ala Leu Pro 
260 265 270 


Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys 
275 280 285 


Val Ala Glu Ala Glu Lys Lys Ala Lys Ala Gin Lys Glu Glu Asp Arg 
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290 


295 


300 


Arg A sn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala 


305 


310 


315 


320 


Glu Ser Asp Val Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu 


325 


330 


335 


Glu Ala Lys Glu Ser Arg Asn Glu Glu Lys Val Asn Gin Ala Lys Ala 


340 


345 


350 


Lys Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys lie Lys 


355 


360 


365 


Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu 


370 


375 


380 


Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro 


385 


390 


395 


400 


Ala Pro Gin Pro Glu Lys Pro Thr Glu Glu Pro Glu Asn Pro Ala Pro 


405 


410 


415 


Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin Pro Lys Ala Glu Lys Thr 


420 


425 


430 


Asp Asp Gin Gin Ala Glu Glu 
435 
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<210> 31 
<211> 419 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 31 

Ala Val Ala Ser Leu Val Met Gly Ser Val Val His Ala Thr Glu Asn 
1 5 10 15 


Glu Gly Thr Thr Gin Ala Pro Thr Ser Ser Asn Arg Gly Asn Glu Ser 
20 25 30 


Gin Ala Glu His Met Lys Ala Ala Lys Gin Val Asp Glu Tyr lie Glu 


35 


40 


45 


Lys Met Leu Gin Leu Asp Arg Arg Lys His Thr Gin Asn Val Gly Leu 
50 55 50 


Leu Thr Lys Leu Gly Ala lie Lys Thr Glu Tyr Leu Arg Gly Leu Ser 


65 


70 


75 


80 


Val Ser Lys Glu Lys Ser Thr Ala Glu Leu Pro Ser Glu lie Lys Glu 
85 50 95 

Lys Leu Thr Ala Ala Phe Lys Gin Phe Lys Lys Asp Thr Leu Lys Pro 
100 105 HO 


Glu Lys Lys Val Ala Glu Ala Glu Lys Lys Val Ala Glu Ala Lys Lys 
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115 


120 


125 


Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn TVr Pro Thr He 


130 


135 


140 


Thr Tyr Lys Thr Leu Glu Leu Glu He Ala 


Glu Ser Asp Val Glu Val 


145 


150 


155 


160 


Lys Lys Ala Glu Leu Glu Leu Val Lys Val 


Lys Ala Asn Glu Pro Arg 


165 


170 


175 


Asp Glu Glu Lys lie Lys Gin Ala Glu Ala 


Val Glu Ser Lys Lys 


180 


185 


190 


Ala Glu Ala Thr Arg Leu Lys Lys He 


Lys Thr Asp Arg Glu Lys Ala 


195 


200 


205 


Glu Glu Glu Ala Lys Arg Arg Val 
210 215 


Asp Ala Lys Glu Gin Asp Glu Ser 
220 


Ser Lys Arg Arg Lys Ser Arg Val 
225 230 


Lys Arg Gly Asp Val Gly Glu Gin 
235 240 


Ala Thr Pro Asp Lys Lys Glu 


Asn Asp Ala Lys Ser Ser Asp Ser 


Ser 


245 


250 


255 


Val Gly Glu Glu Thr Leu Pro Ser 


Pro Ser Leu Lys Pro Gly Lys Lys 


260 


265 


270 
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50 


Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Asp Lys Lys Ala Lys 
275 280 285 

Ala Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys 
290 295 200 


Thr Leu Glu Leu Glu He Ala Glu Ser Asp Val Glu Val Lys Lys Ala 

315 320 


305 


310 


Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Pro Arg Asn Glu Glu 


325 


330 


335 


Lys Val Lys Gin Ala Lys Ala Glu Val Glu Ser Lys Lys Ala Glu Ala 


340 


345 


350 


Thr Arg Leu Glu Lys He Lys Thr Asp Arg Lys Lys Ala Glu Glu Glu 


355 


360 


365 


Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys Glu Lys Pro Ala 


370 


375 


380 


Glu Gin Pro Lys Pro Ala Pro Ala Pro Gin Pro Glu Lys Pro Ala Pro 


385 


390 


395 


400 


Lys Pro Glu Asn Pro Ala Glu Gin Pro Lys Ala Glu Lys Pro Ala Asp 


405 


410 


415 


Gin Gin Ala 
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<210> 32 
<211> 437 
<212> PRT 

< 213> Streptococcus pneumoniae 


<400> 32 

Val Ala Ser Leu Phe Met Gly Ser 
1 5 

Val Thr Thr Gin Val Ala Thr Ser 
20 

Thr Glu His Met Lys Ala Ala Lys 
35 40 

Lys Leu Gin Leu Asp Arg Arg Lys 
50 55 

Thr Lys Leu Gly Val lie Lys Thr 
65 7 0 

Ser Lys Lys Lys Ser Glu Ala Glu 
85 

Leu Asp Ala Ala Phe Glu Gin Phe 
100 


Val Val His Ala Thr Glu Lys Glu 
10 15 

Ser Asn Lys Ala Asn Lys Ser Gin 
25 30 

Gin Val Asp Glu Tyr lie Lys Lys 
45 

His Thr Gin Asn Val Gly Leu Leu 
60 

Glu Tyr Leu His Gly Leu Ser Val 
75 80 

Leu Pro Ser Glu lie Lys Ala Lys 
90 95 

Lys Lys Asp Thr Leu Pro Thr Glu 
105 110 
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Pro Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Lys 


115 


120 


125 


Lys Lys Ala Glu Asp Gin Lys Glu Lys Asp Leu Arg Asn Tyr Pro Thr 


130 


135 


140 


Asn Thr Tyr Lys Thr Leu Glu Leu Asp lie Ala Glu Ser Asp Val Glu 


145 


150 


155 


160 


Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Ser 
165 170 175 


Arg Asp Glu Lys Lys lie Asn Gin Ala Lys Ala Lys Val Glu Asn Lys 
180 185 190 


Lys Ala Glu Ala Thr Arg Leu Lys Asn lie Lys Thr Asp Arg Glu Lys 
195 200 205 


Ala Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Leu Gin Glu Ala Asn 
210 215 220 


Val Ala Thr Ser Glu Gin Asp Lys Ser Lys Arg Arg Ala Lys Arg Glu 
225 230 235 240 


Val Phe Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp Ala Lys 
245 250 255 


Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Thr Ser Pro Ser Leu 


260 


265 


270 
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Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala 


275 


280 


285 


Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro 


290 


295 


300 


Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp Val 


305 


310 


315 


320 


Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu 


325 


330 


335 


Ser Arg Asn Glu Glu Lys He Lys Gin Val Lys Ala Lys Val Glu Ser 


340 


345 


.350 


Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg Lys 
355 360 


365 


Lys Ala Glu Glu Glu Glu Ala Lys Arg Arg Ala Ala Glu Glu Asp Lys 
370 375 


380 


Val Lys Glu Lys Pro Ala Glu Gin Prp Gin Pro Ala Pro Ala 
385 390 


395 


Pro Gin 
400 


Pro Glu Lys Pro Thr Glu Glu Pro Glu Asn Pro Ala Pro Ala Pro Ala 
405 410 


415 


Pro Lys Pro Glu Asn Pro Ala Glu Lys Pro Lys Ala Glu Lys Pro Ala 
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420 


425 


430 


Asp Gin Gin Ala Glu 
435 


<210> 33 
<2 11> 433 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 33 

Cys Thr Val Ala Ser Leu Val Met Gly Ser Val Val His Ala Thr Glu 
15 10 15 


Asn Glu Arg Thr Thr Gin Val Pro Thr Ser Ser Asn Arg Gly Lys Pro 
20 25 30 


Glu Arg Arg Lys Ala Ala Glu Gin Phe Asp Glu Tyr lie Asn Lys Met 
35 40 45 


lie Gin Leu Asp Lys Arg Lys His Thr Gin Asn Leu Ala Phe Asn lie 
50 55 60 


Gin Leu Ser Arg lie Lys Thr Glu Tyr Leu Asn Gly Leu Lys Glu Lys 
65 70 75 80 


Ser Glu Ala Glu Leu Pro Ser Lys lie Lys Ala Glu Leu Asp Ala Ala 
85 90 95 
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Phe Lys Gin Phe Lys Lys Asp Thr Leu Pro Thr Glu Pro Glu Lys Lys 


100 


105 


110 


Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Glu Lys Lys Val Ala 


115 


120 


125 


Glu Ala Lys Lys Lys Ala Lys Ala Gin Lys Glu Glu Asp His Arg Asn 


130 


135 


140 


Tyr Pro Thr He Thr Tyr Lys Thr Leu 


145 


150 


Asp Leu Glu lie Ala Glu Phe 


155 


160 


Asp Val Lys Val Lys Glu Ala Glu Leu Glu Leu Val 


Lys Lys Glu Ala 


165 


170 


175 


Asp Glu Ser Arg Asn Glu Gly Thr lie 


Asn Gin Ala Lys Ala Lys Val 


180 


185 


190 


Glu Ser Glu Lys Ala Glu Ala Thr 
195 200 


Arg Leu Lys Lys He Lys Thr Asp 
205 


Arg Glu Lys Ala Glu Glu Glu Glu Ala 
210 215 


Lys Arg Arg Ala Asp Ala Lys 
220 


Glu Gin Asp Glu Ser Lys Arg Arg Lys 
225 230 


Ser Arg Gly Lys Arg Gly Ala 


235 


240 


Leu Gly Glu Gin Ala Thr Pro Asp Lys 


Lys Glu Asn Asp Ala Lys Ser 
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245 250 255 

Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro Ser Pro Ser Leu Lys 
260 265 270 

Pro Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Asp 
275 280 285 

Lys Lys Ala Lys Ala Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr 
290 295 300 

Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp Val Lys 

305 310 315 320 

Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Ser 

325 330 335 

Arg Asn Glu Glu Lys lie Lys Gin Ala Lys Ala Lys Val Glu Ser Lys 
340 345 350 

Lys Ala Glu Ala Thr Arg Leu Glu Lys He Lys Thr Asp Arg Lys Lys 
355 360 365 

Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys 
370 375 380 

Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro Gin Pro Glu 
385 390 395 400 
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Lys 

Pro 

Ala 

Glu 

Glu 

405 

Pro 

Glu 

Asn 

Pro 

Val 

410 

Pro 

Ala 

Pro 

Lys 

Pro 

415 

Glu 

Asn 

Pro 

Ala 

Glu 

Gin 

Pro 

Lys 

Ala 

Glu 

Lys 

Pro 

Ala 

Asp 

Gin 

Gin 

Ala 


420 


425 


430 


Glu 


<210> 34 
<211> 427 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 34 


Val Ala Val Ala Ser Leu Val Met Gly Ser Val Val His Ala Thr Glu 
1 5 


10 


15 


Lya Glu Val Thr Thr Gin Val Pro Thr Tyr Sar Aan Mat Ala Lya Thr 


20 


25 


30 


Glu Hia Arg Lya Ala Ala Lya Gin Val Val Aap Glu Tyr II. Giu Lys 


35 


40 


45 


Met Leu Arg Glu He Gin Leu Asp Arg Arg Lys His Thr 


Gin Asn Phe 


50 


55 


60 


Ala Phe Asn Met Lys Leu Ser Ala lie Lys Thr Glu Tyr Leu Tyr Gly 
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65 


70 


75 


80 


Leu Lys Glu Lys Ser Glu Ala Glu Leu Pro Ser Glu Val Lys Ala Lys 


85 


90 


95 


Leu Asp Ala Ala Phe Glu Gin Phe 
100 

Glu Lys Val Ala Glu Ala Glu Lys 
115 120 

Ala Lys Ala Gin Lys Glu Glu Asp 
130 135 

Tyr Lys Thr Leu Glu Leu Glu He 
145 ISO 

Lys Ala Glu Leu Glu Leu Leu Lys 
165 


Lys Lys Asp Thr Leu Lys Leu Gly 
105 11° 

Lys Val Ala Glu Ala Glu Lys Lys 
125 

Arg Arg Asn Tyr Pro Thr Asn Thr 
140 

Ala Glu Ser Asp Val Glu Val Lys 
155 160 

Glu Glu Ala Lys Thr Arg Asn Glu 
170 175 


Asp Thr lie Asn Gin Ala Lys Ala Lys Val Glu Ser Lys Lys Ala Glu 
180 185 190 

Ala Thr Lys Leu Glu Glu lie Lys Thr Asp Arg Lys Lys Ala Glu Glu 
195 200 205 

Glu Ala Lys Arg Lys Ala Glu Ala Glu Glu Asp Lys Val Lys Asp Lys 
210 215 220 
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Leu Lys Arg Arg Thr Lys Arg Ala Val 

225 230 


Pro Gly Glu Pro Ala Thr Pro 


235 


240 


Asp Lys Lys Glu Asn Asp Ala Lys Ser 


Ser Asp Ser Ser Val Gly Glu 


245 


250 


255 


Glu Thr Leu Pro Ser Pro Ser 
260 


Leu Lys Ser Gly Lys Lys Val Ala Glu 


265 


270 


Ala Glu Lys Lys Val Ala Glu Ala 


275 


280 


Glu Lys Lys Ala Lys Asp Gin 
285 


Lys 


Glu Glu Asp Arg Arg Asn Tyr 


290 


295 


Pro Thr Asn Thr Tyr Lys Thr Leu Asp 
300 


Leu Glu lie Ala Glu Ser- Asp Val Lys Val 


305 


310 


Lys Glu Ala Glu Leu Glu 


315 


320 


Leu Val Lys Glu Glu Ala Lys Gly 


Ser Arg Asn Glu Glu Lys He Asn 


325 


330 


335 


Gin Ala Lys Ala Glu Val Glu 
340 


Ser Lys Lys Ala Glu Ala Thr Arg Leu 


345 


350 


Glu Lys lie Lys Thr Asp Arg Lys 


355 


360 


Lys Ala Glu Glu Glu Ala Lys Arg 
365 


Lys Ala Ala Glu Glu Asp Lys Val 


370 


375 


Lys Glu Lys Pro Ala Glu Gin Pro 
380 
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Gin Pro Ala Pro Ala 


Pro Gin Pro Glu Lys Pro Thr Glu Glu Pro Glu 


385 


390 


395 


400 


Asn Pro Ala Pro Ala Pro Lys 


Pro Glu Lys Pro Ala Glu Gin Pro Lys 


405 


410 


415 


Ala Glu Lys Thr Asp Asp Gin Gin Ala Glu Glu 
420 425 


<210> 35 
<211> 413 
<212> PRT 

<213> streptococcus pneumoniae 


<400> 35 

Glu Asn Glu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala Lys 


Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu Tyr lie Glu 

20 25 30 


Lys Met Leu Arg Glu lie Gin Leu Asp Arg Arg Lys His Thr Gin Asn 
35 40 45 


Val Ala Leu Asn He Lys Leu Ser Ala lie Lys Thr Lys Tyr Leu Arg 


50 


55 


60 
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Glu Leu Asn Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser Glu 


65 


70 


75 


80 


lie Lys Ala Lys Leu Asp Ala Ala Phe Glu Lys Phe Lys Lys Asp Thr 


85 


90 


95 


Leu Lys Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu 


100 


105 


110 


Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr 


115 


120 


125 


Pro Thr Asn Thr Tyr Lys Thr Leu Glu 
130 135 


Leu Glu lie Ala Glu Phe Asp 
140 


Val Lys Val Lys Glu Ala Glu Leu Glu 
145 150 


Leu Val Lys Glu Glu Ala Lys 


155 


160 


Glu Ser Arg Asn Glu Gly Thr lie 


Lys Gin Ala Lys Glu Lys Val Glu 


165 


170 


175 


Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu 


Asn He Lys Thr Asp Arg 


180 


18.5 


190 


Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys 
195 200 


Ala Asp Ala Lys Leu Lys 
205 


Glu Ala Asn Val Ala Thr 


Ser Asp Gin Gly Lys Pro Lys Gly Arg Ala 


210 215 


220 
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Lys Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn 


225 


230 


235 


240 


Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro Ser 


245 


250 


255 


Ser Ser Leu Lys Ser Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val 


260 


265 


270 


Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg 
275 280 285 


Asn. Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Asp Leu Glu lie Ala Glu 


290 


295 


300 


Ser Asp Val Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu 


305 


310 


315 


320 


Ala Lys Glu Pro Arg Asp Glu Glu Lys He Lys Gin Ala Lys Ala Lys 
325 330 335 


Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr 
340 345 350 


Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu 
355 360 365 


Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala 
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370 


375 


380 


Thr Gin Pro Glu Lys Pro Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin 


385 


390 


395 


400 


Pro Lys Ala Glu Lys Thr Asp Asp Gin Gin Ala Glu Glu 
405 410 


<210> 36 
<211> 425 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 36 

Tyr lie Ala Ser Leu Phe- Leu Gly Gly Val Val His Ala Glu Gly Val 


Arg Ser Glu Asn Asn Pro Thr Val Thr 


Ser Ser Gly Gin Asp lie Ser 


20 


25 


30 


hys Lys Tyr Ala Asp Glu Val Lys Ser His Leu Glu Lys lie Leu Ser 


35 


40 


45 


Glu He Gin Thr Asn Leu Asp Arg 


Ser Lys His lie Lys Thr Val Asn 


50 


55 


60 


Leu He Asn Lys Leu Gin Asp lie Lys Arg Thr Tyr Leu Tyr Glu Leu 


65 


70 


75 


80 
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Asn Val Leu Glu Asp Lys Ser Lys Ala Glu Leu 
85 9° 


Pro Ser Lys lie Lys 
95 


Ala Glu Leu Asp Ala Ala Phe Glu 
100 

Thr Glu Pro Gly Lys Lys Val Ala 
115 120 

Ala Glu Lys Lys Ala Lys Ala Gin 
130 I 35 

Pro Thr lie Thr Tyr Lys Thr Leu 
145 150 

Val Lys Val Lys Glu Ala Glu Leu 
165 


Gin Phe Lys Lys Asp Thr Leu Pro 
105 HO 

Glu Ala Lys Lys Lys Val Glu Glu 
125 

Lys Glu Glu Asp Tyr Arg Asn Tyr 
140 

Glu Leu Glu lie Ala Glu Ser Asp 
155 160 

Glu Leu Val Lys Lys Glu Ala Asp 
170 175 


Glu Ser Arg Asn Glu Gly Thr He Asn Gin Ala Lys Ala Lys Val Glu 


180 


185 


190 


Ser Glu Gin Ala Glu Ala Thr Arg Leu Lys Lys lie Lys Thr Asp Arg 


195 


200 


205 


Glu Lys Ala Glu Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Glu Gin 


210 


215 


220 


Asp Glu Ser Lys Arg Arg Lys Ser Arg Val Lys Arg Gly Asp Phe Gly 
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225 


230 


235 


240 


Glu Pro Ala Thr Pro Asp Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp 
245 250 255 


Ser Ser Val Gly Glu Glu Thr Leu Pro Ser Pro Ser Leu Lys Pro Gly 
260 265 270 


Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Glu Lys Lys 
275 280 285 


Ala Lys Asp Gin Lys Glu Glu Asp His Arg Asn Tyr Pro Thr lie Thr 
290 295 300 


Tyr Lys Thr Leu Glu Leu Glu lie 
305 310- 

Lys Ala Glu Leu Glu Leu Val Lys 
325 

Glu Glu Lys Val Lys Gin Ala Lys 
340 

Glu Ala Thr Arg Leu Glu Lys lie 
355 360 

Glu Glu Ala Lys Arg Lys Ala Ala 
370 375 


Ala Glu Ser Asp Val Glu Val Lys 
315 320 

Glu Glu Ala Lys Gly Ser Arg Asn 
330 335 

Ala Glu Val Glu Ser Lys Lys Ala 
345 350 

Lys Thr Asp Arg Lys Lys Ala Glu 
365 

Glu Glu Asp Lys Val Lys Glu Lys 
380 


VSSDOCID: <WO 9951 187A2_I_> 



WO 99/51187 


66 


PCT/US99/07668 


Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro Gin Pro Glu Lys Pro 

385 390 395 400 

Ala Pro Ala Pro Lys Pro Glu Asn Pro Ala Glu Gin Pro Lys Ala Glu 

405 410 415 

Lys Pro Ala Asp Gin Gin Ala Glu Glu 
420 425 

<210> 37 
<211> 439 
<2 12> PRT 

<213> Streptococcus pneumoniae 
<400> 37 

Ala Ser Leu Phe Leu Gly Gly Val Val His Ala Glu Gly Val Arg Ser 
15 10 15 

Gly Asn Asn Ser Thr Val Thr Ser Ser Gly Gin Asp lie Ser Lys Lys 
20 25 30 

Tyr Ala Asp Glu Val Glu Ser His Leu Gin Ser lie Leu Lys Asp Val 
35 40 45 

Asn Lys Asn Leu Lys Lys Val Gin His Thr Gin Asn Ala Asp Phe Asn 
50 55 60 

Lys Lys Leu Ser Lys lie Lys Thr Lys Tyr Leu Tyr Glu Leu Asn Val 
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65 


70 


75 


80 


Leu Glu Glu Lys Ser Glu Ala Glu Leu Thr Ser Lys Thr Lys Glu Thr 
85 90 95 


kys Glu Glu Leu Thx Ala Ala Phe Glu Gin Phe Lys Lys Asp Thr Leu 
100 105 HO 


Ser Thr Glu Pro Glu Lys Lys Val Ala Glu Ala Lys Lys Lys Val Glu 

120 125 


Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Lys Asp Arg Arg Asn 
130 135 140 


Tyr Pro Thr He Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser 


145 


150 


155 


160 


Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Val Lys Ala 
165 170 175 


Asn Glu Pro Arg Asp Glu Glu Lys lie Lys Gin Ala Glu Ala Lys Val 


180 


185 


190 


Glu Ser Lys Gin Ala Glu Ala Thr Arg Leu Lys Lys He Lys Thr Asp 


195 


200 


205 


Arg Glu Gin Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg 


210 


215 


220 
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Glu Gin Ala Glu Glu Glu Ala Lys Val Lys Asp Glu Pro Lys Lys Arg 

225 230 235 240 

Thr Lys Arg Gly Val Leu Gly Glu Pro Ala Thr Pro Asp Lys Lys Glu 

245 250 255 

Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro 
260 265 270 

Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys 
275 280 285 

Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg 
290 295 300 

Arg Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala 

305 310 315 320 

Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu 

325 330 335 

Glu Ala Lys Glu Pro Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala 
340 345 350 

Glu Val Glu Ser Lys Gin Ala Glu Ala Thr Arg Leu Glu Asn He Lys 
355 360 365 

Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu 
370 375 380 
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Glu Asp Lys Val Lys Glu Lys 
385 390 


Pro Ala Glu Gin Pro Gin Pro Ala Pro 
395 400 


Ala Pro Gin Pro Glu Lys Pro Ala p r « r _ 

Ala Pro Lys Pro Glu Lys Pro Ala Pro 


405 


410 


415 


Ala Pro Lys Pro Glu Asn Pro Ala Glu Gin Pro Lys Ala Glu Lys Pro 


420 


425 


430 


Ala Asp Gin Gin Ala Glu Glu 
435 


<210> 38 
<211> 460 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 38 


cys II. Val Ala S.r w val Met Gly Ser val Val Bia Ala Thr Glu 


10 


15 


Glu Gly Ala Thr Gin Val Pro Thr Ser Ser A,» Ar g Ala 


Asn Glu 


20 


25 


30 


S.r Gin Ala Glu Gin Gly Glu Gin Pro Ays hys he„ Asp Ser Glu Ar 3 
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Asp Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys He Val Gly 


50 


55 


60 


Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr He Thr Val Ala 


65 


70 


75 


80 


Leu Val Asn Glu Leu Asn Asn lie Lys Asn Glu Tyr Leu Asn Lys lie 


85 


90 


95 


Val Glu Ser Thr Ser Glu Ser Gin Leu Gin lie Leu Met Met Glu Ser 


100 


105 


110 


Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp Ser Ser 


115 


120 


125 


Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser Asp Thr 


130 


135 


140 


Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala Glu Ala 


145 


150 


155 


160 


Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu 


165 


170 


175 


Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu Glu Leu 


180 


185 


190 


Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu 


195 


200 


205 
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Val Lys Val Lys Ala Asn Glu 


210 


215 


Pro Arg Asp Glu Gin Lys He Lys Gin 
220 


Ala Glu Ala Glu Val Glu Ser 
225 230 


Lys Gin Ala Glu Ala Thr Arg Leu Lys 


235 


240 


Lys lie Lys Thr 


Asp Arg Glu Glu Ala Glu Glu Glu Ala Lys 


245 


250 


Arg Arg 
255 


Ala Asp Ala Lys Glu Gin Gly Lys 
260 


Pro Lys Gly Arg Ala Lys Arg Gly 


265 


270 


Val Pro Gly Glu 
275 


Leu Ala Thr 


Pro Asp Lys Lys Glu Asn Asp Ala Lys 


280 


285 


Ser Ser Asp Ser Ser Val Gly 


Glu Glu Thr Leu Pro Ser 


290 


295 


300 


Pro Ser Leu 


Lys Pro Glu Lys Lys Val Ala 


305 


310 


Glu Ala Glu Lys Lys Val Glu 
315 


Glu Ala 
320 


Lys Lys Lys Ala Glu Asp Gin 


Lys Glu Glu Asp Arg Arg Asn Tyr Pro 


325 


330 


335 


Thr Asn Thr Tyr Lys Thr 
340 


Leu Glu Leu Glu lie Ala Glu Ser Asp Val 


345 


350 


Glu Val Lys Lys Ala Glu 


Leu Glu Leu Val Lys Glu Glu Ala 


Lys Glu 
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355 


360 


365 


Pro Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala Glu Val Glu Ser 


370 


375 


380 


Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys 


385 


390 


He Lys Thr Asp Arg Lys 
395 400 


Lys Ala Glu Glu Glu Ala Lys Arg Lys Aria Ala Glu Glu Asp Lys Val 


405 


410 


415 


Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro Lys Ala 


420 


425 


430 


Glu Lys Pro Ala Pro Ala Pro Lys Pro Glu Asn Pro Ala Glu Gin Pro 


435 


440 


445 


Lys Ala Glu Lys Pro Ala Asp Gin Gin Ala Glu Glu 


450 


455 


460 


<210> 39 
<211> 459 
<212> PRT 

<213> Streptococcus pneumoniae 


<400> 39 

He Val Ala Ser Leu Val Met Gly Ser Val Val His Ala Thr Glu Asn 
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Glu Gly Ala Thr Gin Val Pro Thr Ser Ser Asn Arg Ala Asn Glu Ser 
20 25 30 


Gin Ala Glu Gin Gly Glu Gin Pro Lys Lys Leu Asp Ser Glu Arg Asp 
35 , 40 45 


Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys Xle Val Gly Glu 
50 55 60 


Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr lie Thr Val Ala Leu 


65 


70 


75 


80 


Val Asn Glu Leu Asn Asn Xle Lys Asn Glu Tyr Leu Asn Lys He Val 


85 


90 


95 


Glu Ser Thr Ser Glu Ser Gin Leu Gin lie Leu Met Met Glu Ser Arg 


100 


105 


110 


Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp Ser Ser Ser 
115 120 125 


Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser Asp Thr Ala 
130 135 140 


Lys Pro Asn Lys Pro Thr Glu 
145 150 

Lys Lys Val Glu Glu Val Glu 


Pro Gly Glu Lys Val Ala Glu Ala Lys 
155 160 

Lys Lys Ala Lys Asp Gin Lys Glu Glu 
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165 

170 

175 



Asp Arg Arg Asn Tyr Pro Thr He Thr Tyr Lys Thr Leu Glu Leu Glu 
180 185 190 


lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val 
195 200 205 


Lys Val Lys Ala Asn Glu Pro Arg Asp Lys Gin Lys lie Lys Gin Ala 
210 215 220 


Glu Ala Glu Val Glu Ser Lys Gin Ala Glu Ala Thr Arg Leu Lys Lys 
225 230 235 240 


lie Lys Thr Asp Arg Glu Glu Ala Glu Glu Glu Ala Lys Arg Arg Ala 
245 250 255 


Asp Ala Lys Glu Gin Gly Lys Pro Lys Gly Arg Pro Lys Arg Gly Val 
260 265 270 


Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp Ala Lys Ser 
275 280 285 


Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro Ser Pro Ser Leu Lys 
290 295 300 


Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Lys 
305 310 315 320 
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Ly s Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr 


325 


330 


335 


Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp Val Glu 
340 345 350 


Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Pro 


355 


360 


365 


Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala Glu Val Glu Ser Lys 


370 


375 


380 


Lys Ala Glu Ala Thr Arg Leu Glu Lys lie Lys Thr Asp Arg Lys Lys 


385 


390 


395 


400 


Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys 


405 


410 


415 


Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro Lys Thr Glu 
420 425 430 

Lys Pro Ala Pro Ala Pro Lys Pro Glu Asn Pro Ala Glu Gin Pro Lys 
435 440 445 


Ala Glu Lys Pro Ala Asp Gin Gin Ala Glu Glu 
450 455 
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